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Introduction 


This book is definitely not a book on mathematics. It is a book on the use 
of symmetries, mainly described by the techniques of Lie groups and Lie 
algebras. Although no proofs of theorems and the like are given, except 
in special cases, the ideas are very firmly based on a lifetime of lecturing 
experience. 


xiii 
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Symmetries and Conservation Laws 


You may already be familiar with the ideas of conserved quantities, such as 
charge in electromagnetism, but it will not hurt to go through this once more, 
and there may be students for whom it is quite new. Since we are dealing with 
elementary particles, we may as well think of conserved numbers carried on 
particles, and indeed we will start with the charge e on the proton. If we 
consider the charge of the electron (—e), which carries electric currents, what 
do we mean by "it is conserved" and what consequences might this have? We 
might as well, for simplicity, start with the problem in classical physics and 
turn to quantum mechanics later. Well, the first thing is that it cannot simply 
vanish or appear. Of course it can vanish by having equal but opposite charges 
annihilate it (producing, for example, the photons of light), or it can appear 
in the reverse of this. All other conserved quantities such as energy, and 
linear and angular momentum must be conserved—in our picture carried on 
the photons. Already we see that this must happen at the same time and at 
the same spatial point, but this is natural when the charges are carried on the 
particles. 

You may well be familiar with the idea of conservation of charge being 
associated with the four divergence of the current carrying that charge. Calling 
j” the current carried by an electron (of charge (—)e) we can write 


Oni” =0. (1.1) 
Then we have 
dpot + V.j =0 (1.2) 


where p is the time component of j” and j is the spatial part of this current. 
If we integrate over a fixed volume we find 


9 
E: + flow of current normally into the volume 


— flow of current normally out of the volume = 0. (1.3) 


This means that the rate of increase of charge in the volume is equal to the rate 
of flow of charge into the volume minus the rate of flow out of the volume. A 
very natural feature of the model we use is where the charges are carried on the 
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particles. Of course, this concept needs slight changing in the world of special 
relativity where there is apparent contraction of lengths and dilation of times 
in different reference frames. Similarly in quantum mechanics further modi- 
fications are needed, which are yet further changed in quantum field theory. 
But we are getting too far ahead of ourselves. Let us ask what symmetries have 
to do with these conservation laws as our title of this chapter suggests. There 
is a theorem by E. Noether [1] to the effect that this is precisely what happens. 
It is not appropriate to prove this theorem at this stage, but it is very pow- 
erful and extends to all types of description of the physics discussed earlier. 
(Students note that Noether was a woman doing important work of this type 
at a time when there were nowhere near as many women working in science.) 

The point that is necessary to understand at this stage is that all conserved 
quantities in physics are linked to symmetries in this way. We shall meet 
examples of this later. The mathematics underlying this structure is that of 
group theory, both discrete groups and continuous groups as described by Lie. 
But for the moment we move on to simple examples in the next two chapters. 


Lagrangian and Hamiltonian Mechanics 


Although it has been made clear that the reader is expected to be competent 
in quantum field theory, an exception is made at this point to be sure that the 
readers really can cope. 

Itis one of those curious quirks of history that long before quantum theory 
was developed this version of classical mechanics established a framework 
that was capable of treating both fields and particles in both classical and 
quantum aspects. You are strongly urged to read Chapter 19 of Volume II of The 
Feynman Lectures on Physics [2] as an introduction to the deep and fascinating 
approach to physics in terms of the "principle of least action," if you have not 
met it previously. We shall approach the topic in a more pedestrian manner 
than Feynman, partly because I am not so brave a teacher and partly because 
I want to get you calculating for yourself as soon as possible. It is my firm 
belief that the best way to get on top of a subject like this is to lose your fear 
of it by getting your hands dirty and actually doing the real calculations in 
detail yourself. 

Suppose we have a one-dimensional system—yes, it is going to be the 
harmonic oscillator. We shall call the displacement from equilibrium q(t) 
rather than x(t) because later on we shall want “displacements of the fields" 
at various points x and we do not wish to confuse the "displacements" with 
the spatial positions. Then Newton's second law is replaced by the Euler- 
Lagrange [3] equation 


doL aL 


ae Eid 14 
dtoj og S 
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where ġ is the time derivative of q. The Lagrangian, L, is the difference 
between the kinetic energy (T) and the potential energy (V), that is, 


L(2,)) -T- V (1.5) 


and is to be regarded as a function of the independent variables q and q for 
the purposes of partial differentiation. For the harmonic oscillator with mass 
m and spring constant k we have 


V= T = iid $ (1.6) 
where œ? = £. So that 

L= zi ee ne 2 (1.7) 
and the Euler-Lagrange equation yields 

d i 2 

qa iid) = -moq (1.8) 
and we retrieve 

į = —oq (1.9) 


as expected. 

Now that we have a little experience with this formalism, we can take a look 
at the principle of least action. You will have noticed perhaps that the concept 
of force (which was primary in Newton's approach) has become secondary to 
the idea of potential. The least action principle makes the equation of motion 
itself something that is derived from the minimization of the action 


t 
S= L(q,q)dt (1.10) 
ti 
where t; and ty are initial and final times. The principle postulates that the 
actual path (often alternatively called trajectory) followed by the particle is 
that which minimizes 5S. Imagine that, given L as an explicit function of q 
and à, you evaluate S for a few paths. These are just fictitious paths and none 
of them is likely to be the Newtonian one. I have drawn the three from the 
problem on the q-t diagram in Figure 1.1. 

These must start and finish at the same places and times. According to the 
principle, only if one of these coincides with the Newtonian path will the 
value of S be the minimum possible. You need a calculus approach to get a 
general answer. Notice, however, S is a function of the function q(t). We say 
itis a "functional" of q(t). We need to find the particular function, qo(t), that 
minimizes S. 

Suppose there is a small variation ôq (t) in a path q(t) from q(t;) to q(ty). 
When q(t) = qo(t), the variation 5S caused by this change ôq must vanish. 
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FIGURE 1.1 
q-t diagram. 


Now we can work out the change of action for any path as 


tf (OL =) 
t (5 T oq 


tt foL d [9L d [9L 
= 8 ôg | dt 
i - a A 1) 
ty ab: rab ƏL J” 
" oq dt | 0g od ^], 


where we used 6q = £59 in the second step. But we are considering paths 
with fixed end points, so that 5q(t;) = 0 = ôq (tp) for any variation, and the 
final term vanishes. Hence, since 6S must vanish for arbitrary ôq, we need 


d oL aL 
dtoj aq’ 


which retrieves the Euler-Lagrange equation of motion. The solution of this 
is the qo(t), which gives the path actually followed by the particle. 

As we shall see later, this formalism is well suited to treat systems of the 
many (indeed infinitely many) linked dynamical variables found in field 
theories. But the transition from classical to quantum mechanics is made more 
transparent by considering the Hamiltonian formulation. The idea, in the first 
place, is to find a change in variables (from q and 4) which will replace the 
second order Euler-Lagrange equation by two linked first order equations. 
This piece of magic is performed by introducing 


_ aL 


ae (1.11) 


P 
as a “generalized momentum conjugate to the generalized coordinate q.” 
(When 4 is a Cartesian coordinate, p will frequently be the usual linear 
momentum, as we shall see.) Then the Hamiltonian is introduced by the 
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Legendre transformation 


H(q, p) = pq — L(q,4) (1.12) 
and the Euler-Lagrange equation is replaced by the pair of equations 
H 
q = ae (1.13) 
op 
oH 
b = —— 1.14 
pet (1.14) 


which are known as Hamilton's canonical equations. To get a feel for this for- 
mulation we return to our old friend the harmonic oscillator. From Equation 
(1.7) we see that 


p---m, (1.15) 


m 2m 
2^ ena 
OP mon 
2m 2 


is the form of the Hamiltonian in the new variables. Notice that the Hamilto- 
nian is the total energy, T + V. This is a very general feature, and provided 
that time does not appear explicitly then 


dH 3H. dH. əə3HəƏH əəH| aH 
= į i =0, (1.16) 


dt aq! op’ ag 9p op aq 


which reflects energy conservation. In the present case the equations of mo- 
tion, Equations (1.13) and (1.14), yield 


j-- (1.17) 


p = -moq (1.18) 


when Equation (1.16) is used directly. The first of these reconfirms the defini- 
tion of the momentum, and on substitution into the second retrieves Equation 
(1.9) as the second order equation of motion. It turns out, however, to be in- 
structive to solve the first order Equations (1.17) and (1.18) directly. Consider 
the linear combination 


1 1 
A= er). (1.19) 


6 Group Theory for the Standard Model of Particle Physics and Beyond 


which is so designed that 
A=-—iwA (1.20) 
A= aqe“! (1.21) 


as the obvious solution, where a is constant. Taking the complex conjugate of 
Equation (1.19), we immediately find 


1 1 . : 
x= — (A+ A) = ae i^t a*i), 1.22 
PA ) ZW ) (1.22) 


which is equivalent to the previous solution. 


Quantum Mechanics 


The passage to quantum mechanics in this formalism is facilitated by intro- 
ducing the Poisson bracket notation. The Poisson bracket of any two functions 
f and 2, of q and p, is simply 


DEEP EE: (123) 

and we see that 
{q, Hj-24 (1.24) 
ip H} =p (1.25) 


are alternative ways of writing Equations (1.13) and (1.14), the equations of 
motion. Moreover, if F is any function of q and p, then 


dF COPS SUE. 
at aq!" ap 
Esk H d H}={F,H (1.26) 
while 
{q,q}=0 
{p, p} =0 
{q, p} = 1 (1.27) 


follow directly from the definition (Equation (1.24)) of the Poisson bracket. 
The transition to quantum mechanics is now effected by the correspondence 
(e, 8) ^ —i[â, 8] = —i(àB — Bá) between the classical dynamical variables 


Symmetries and Conservation Laws 7 


and their hatted quantum mechanical operator correspondences. (We use 
“natural” units with A = 1.) In particular, Equation (1.27) yields 


[q(t), PH) 2 i (1.28) 


expressing the Heisenberg uncertainty principle [5], and Equation (1.26) gives 


——" = -i[£ (t), H] (1.29) 


as the Heisenberg equation of motion. The time dependence has been exhib- 
ited to draw the reader's attention to the fact that this is quantum mechanics 
expressed in the Heisenberg picture [6], where states are time independent 
but the dynamical variables contain the time dependence. 

The alternative Schródinger picture, in which the variables are time inde- 
pendent, has the time dependence of state vectors given by the Schródinger 
equation 


Alw(t) > = iL iw) > (1.30) 


with the formal solution 
IV (f) > =e Fly > (1.31) 


where we have identified the Schrödinger state at time zero with |y(0) >, 
with |y > the time independent Heisenberg state. Of course, Equation (1.31) 
is just a unitary transformation between the two pictures, with 


Pct) - e'P (t) = eiB(t) E eit) (1.32) 


as the corresponding transformation between operators. The important 
feature of this is that 


[5 pl =i (1.33) 


follows immediately from Equation (1.28) as an expression of the uncertainty 
principle in the Schrédinger picture. In quantum field theory we shall find 
the Heisenberg picture very convenient. 

In the quantum case we have the operator version of Equation (1.15) 


p? mo» 
A= - LT (1.34) 
^2 t 2 
= Un R (1.35) 
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with Equation (1.32) giving trivially the equality of these alternate forms. 
From the Heisenberg equation of motion (Equation (1.29)) we can easily see 
that 


q(t) = P (1.36) 

p(t) = -må (t) (1.37) 
so that we get 

ÄH) = -moâ (t) (1.38) 


by combining these. Now, please notice that this is not just the classical equa- 
tion of motion (Equation (1.9)) again. What Equation (1.38) tells us is the 
behavior of the operator with time, not where the particle can be found. If 
we take the expectation value of Equation (1.38) between (time independent) 
Heisenberg states, then we learn that the mean position of the particle does 
follow the classical path. This is very reassuring, but there will be quantum 
fluctuations about the classical path, of course. 


The Oscillator Spectrum: Creation and Annihilation Operators 


This subtopic is of such central importance later that it deserves a section 
all to itself. You have no doubt all been exposed to this material before, but 
I want to stress the operator treatment that we shall see again in our field 
theory. (If you already know this method, it will at least serve as a review and 
to establish notation.) 

We seek a set of states |E, >, n = 0,1,..., to serve as a complete basis in 
which to expand any general state, and thus must solve the time independent 
Schródinger equation 


EE, > E,\En > (1.39) 


for the eigenvalues and eigenvectors. The Hamiltonian is given in Equation 
(1.34) as 


22 2 
peu fe 
2n ^ 2 4 


but our classical treatment suggests Equation (1.19) 


a=, (aima ip) (1.40) 
n 1 f, te at 
at = J (ama cip) (1.41) 


Symmetries and Conservation Laws 9 


as preferable dynamical variables. It is straightforward to see that 


mw T OPER 
âtâ = -— į? + p+ 514, P] 


1/ Pp mw > 
HC 2 P) 2 


where Equation (1.33) is used in the last term. Hence we have 


= 


[0] 


fi = oí + D: (1.42) 
A = ofa’ — > (1.43) 
so that 
A= sata +aa') (1.44) 
[a,at]=1 (1.45) 
follow by adding and subtracting. Notice that (from Equation (1.42)), 
[H, â] = wf" [a, a*] 
[H, 4°] = wat (1.46) 
[H, â] 2 aâ. (1.47) 


(In Equations (1.45)-(1.47) we now have the algebraic information in a suitable 
form to find the spectrum. I urge you to do Problem 1.14 before continuing.) 
We are now in a position to see exactly why â and à! are so important. They 
have the magical property in that they take you from one energy eigenstate 
into another, rather than into some arbitrary linear combination of states. To 
see this, consider the effect of the Hamiltonian on an eigenstate that has been 
changed by the action of 4? 


Alâ E, > = ((4,4"]4+ à ÉD)E, > 
= (of! +4°E,)|En > 
=(E,+o)a"|E, > (1.48) 


so we see that 4*|E, > is indeed an eigenstate of H and (E, + œ) is the 
eigenvalue. In a similar way we can establish that 4|E,, > is an eigenstate 
with (E; — œ) as the eigenvalue this time. Of course, you cannot lower the 
energy until it becomes negative, so there must be a ground state of lowest 
energy Eo with 


a\Eo>=0 (1.49) 


10 Group Theory for the Standard Model of Particle Physics and Beyond 


as its definition to maintain consistency. (Beware! In relativistic physics such 
reasoning will not be true.) But here you can prove it. From Equation (1.42) 
we see that 


2 1 
H|Eg» =O+ z®lEo > 


establishing Eo = ic as the ground state (or zero point) energy. Then, by 
raising, we see that the energy spectrum is 


1 
E - (n )o n- 0,1... (1.50) 


and the corresponding eigenstates are given by 


[Eo > (1.51) 


where the exact factor follows from the requirement 

XESE; >= 1 (1.52) 
of normalization. It is now natural to speak of a vacuum rather than a ground 
state, and then to envisage the “creation of particles” (or “excitation of quanta”) 
into that vacuum. Indeed if we define a number operator 


N=ata (1.53) 


to conform to our notation in Equations (1.42) and (1.50), then the change of 
notation to 


Nin > 2 n|n > (1.54) 
fi|n > = E,[n >= (n + 1/2)o|n > (1.55) 


becomes irresistible. 


Coupled Oscillators: Normal Modes 


Before welaunchinto an attack on the quantum field theory of infinitely many 
degrees of freedom, it is probably sensible to try a finite number of variables. 
Let's start with the classical theory of two equal masses in a one-dimensional 
space (e.g., in a straight slot on a horizontal table) tied together by a spring of 
spring constant g, and tied to fixed points by springs of spring constant k. 
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qı q2 


FIGURE 1.2 
Three spring forces. 


I have in mind the picture in Figure 1.2, where qı and q2 are displacements 
from equilibrium, and the Lagrangian takes the form 


d. iae, os NIE 1 
L = am(dt +42) — kE + 42) — 58602 — 4" (1.56) 


if none of the springs are stretched or compressed in the equilibrium position. 
You can think of this as a model of a (very) small solid. One advantage of the 
Lagrangian approach is that we never have to introduce the forces in the 
springs and then eliminate them again; constraints are handled very neatly 
in this formalism. The Euler-Lagrange equations yield 


z Kk 8 

ar Cey m (12 =q) (1.57) 
E can (1.58) 
J2 = 7 108 m2 q1), . 


which are sufficiently simple that we do not need formal methods to solve 
them. We spot the relevant combinations of variables by adding and subtract- 
ing to obtain 


k 
(i + G2) = =z + q2) (1.59) 


k 
(sq -- (54%) a-m, (1.60) 


which we recognize as uncoupled simple harmonic oscillators. The solutions 
are then obvious. We have one normal mode of oscillation with frequency 


o = JE (1.61) 


and Equation (1.60) is satisfied trivially by having the two displacements 
equal. The second normal mode has frequency 


o = p a (1.62) 
m 


and Equation (1.59) is satisfied trivially by the two displacements being equal 
but opposite in sense. The general solution is then obtained by superposition 
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(because the equations are linear) and we have 
qı = Acos(ot + 6) + Bsin(oot + £) (1.63) 
q2 = Acos(at + 6) — Bsin(wot + £) (1.64) 


where A, B, ô, and e are constants to be fixed by initial conditions. 

Notice that the frequency of the lowest mode is independent of g—naturally, 
because if q2 = qi, then the middle spring is not stretched. Indeed, if k = 0 
this mode is of zero frequency. You can then think of a two atom molecule that 
is free, and this mode corresponds to the free motion of the center of mass. 
This is not really important for these lectures, but when you hear theorists 
worrying about zero modes and symmetries you will have some idea of their 
problems. Zero modes can be a real pain for theorists as they usually need 
separate treatment, and the associated symmetry (here just translation) is not 
always easy to find. 

Now, how do we quantize a system like this? The key lies in the observation 
made earlier that we are just dealing with uncoupled harmonic oscillators in 
terms of 


Q= 7 qa and Q = 7a - 40) (1.65) 


as the variables, where the normalization factor is for convenience. It is now 
easy to work out the form 


1 k k 

H= xe PD - 50 - (5 +8) 2 (1.66) 
for the Hamiltonian. So to quantize we simply have to put hats on the Q's 
and P’s, and do the harmonic oscillator problems twice. You should check, of 
course, that the commutation relations are 


[Pi, Pj] 2 0 (1.67) 


where ij = 1,2, as you expect in terms of the new variables. Then there is a 
vacuum state, two number operators Ñ; and Ñ; (one for each of Ô; and Ô, 
systems), and states |n, n2 > with (ri + i)o1 + (m + 5)@2 as the energies. 
Now it is probably fairly clear that this idea generalizes to the N variables 
case. The eigenstates of the Hamiltonian are denoted by [r, n2 ... y >, and 
are associated with energy eigenvalues En, r2 ...nwy = YN (n, + io, with 
the various œ, depending upon the details of masses and spring constants. 
Notice that the emphasis has now changed completely from “displacements 
of atoms” to “excitations in the solid.” It is particularly important to note 
that there is no restriction (except the total energy available) on the number 
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of excitations, even though the number of underlying "displacements" is 
still quite finite at this stage. This approach is central to many body physics 
where one speaks of “phonons” as the vibrational excitations (similarly plas- 
mons for plasma oscillations, and magnons for magnetic oscillations). What 
we are leading toward is a framework in which all elementary particles 
(quarks, leptons, W*~, Zo, photons, gluons) are excitations of underlying 
fields. However, we must first learn to handle a simple classical continuous 
system. 


One-Dimensional Fields: Waves 


What is to be our generalization of the Lagrangian in Equation (1.56) when 
there is a continuum of “atoms” rather than just two? The sum over two 
sites becomes an integral over the position x, and presumably the last term 
becomes proportional to a spatial derivative. We shall have to absorb dimen- 
sions into the constants, of course, and we use $(x, f) rather than q(x, t) for 
future convenience, so that we write 


2 2 2 2 9 2 
r= f [u (2) e 5 £3 Je (1.68) 


where p is a mass density, and u and c are just convenient names for the 
modified constants. Now what is the generalization of the Euler-Lagrange 
equations? In this continuous case we define a Lagrangian density £ by 


b f £dx (1.69) 
so that 
S= f nit -. [ araxe (1.70) 


is the action to be minimized. (I am being deliberately vague about the limits 
of the integrals. You may think of a solid between x = 0 and x = L, or of a 
field extending over all space.) The new feature in the continuous case is that 
£ depends not only on ¢ and ¢ = a6 but also on ae and all these must be 
varied. Thus 


= aL 8E. ap ap 
8S = [ 4 EX x ETE (2) (1.71) 
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and we integrate by parts in t for the middle term, and by parts in x for the 
final term, to get 


ƏL 0 (ƏL d ƏL 
s= | LUE: at (5) am] R 


when the end-point (or boundary) terms are assumed to vanish. Since S is 
arbitrary we get the Euler-Lagrange field equations 


bf TLN ð L 
9$ ət (5) WT (a5) MR) 
with 
L cB SE ac 
ap at (2) us (xvs;) H9 


as the generalization to three spatial dimensions. 
Putting the expression for £ implicit in Equation (1.68) into Equation (1.73) 
we get 
0- ae 2° 


—-—c 


2 
"o6 aa tee (1.75) 


as the field equation. Now for some good news and some bad news. The 
good news is that (not entirely by accident) this happens to be a relativistic 
equation suitable for discussing spinless particles (like the Higgs boson), so 
our warm-up exercises are already covering relevant material. The bad news 
is that there are a few snags in its interpretation in relativistic physics. For 
the moment we merely have to notice that we already know a lot about this 
equation. If we ignore the u? term, we have 

9$ _ 2% 

2m 6 (1.76) 
which is the familiar wave equation (from electromagnetism, e.g., where $ 
would be a component of the E or B field) with general solution 


o = f(x— ct) + g(x + ct) (1.77) 


where f and g are arbitrary functions representing, respectively, waves trav- 
eling to the right and left with velocity c, which we shall now set equal to one 
in "natural" units. Moreover, we are familiar with the idea of superimposing 
sinusoidal solutions to produce standing waves, when discrete frequencies 
arise from the boundary conditions, as in the case of sound waves in a tube 
or transverse waves on a violin string. Then a general solution can be written 
as a Fourier series of these sinusoidal ones. 

But this is already enough hint to see what we need to do for the full equation 
Equation (1.75)—the sinusoidal functions will still work, but the frequencies 
will be u? dependent. Assume for simplicity that the end "atoms" in our linear 
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chain are constrained to be at rest at x = 0 and x = L. Then a suitable trial 
solution ensuring this is 


p(x, t) = A,(t) sin (=) (1.78) 


provided that r is an integer. Substituting into Equation 1.75 gives 


A, = -o A, (1.79) 
r^g? 
of = wrt Spar (1.80) 


and we see that we are back to the simple harmonic oscillator again. Of course 
a general solution, by superposition, may be written as 


(x,t) = »» A, (Dsin (=) (1.81) 
T=1 


with each A,(t) being associated with its own characteristic r frequency as 
given earlier. So what we have discovered is that the Fourier series gives the 
mode analysis. The sinusoidal functions in x give exactly the correct “lin- 
ear combinations of coordinates” to pick out the separate frequencies—the 
Fourier amplitudes, A;, act exactly as do normal coordinates. To confirm this 
view we can construct the Hamiltonian for this system, and see if the solu- 
tion (Equation 1.81) reveals a sum of uncoupled oscillators. It is clear in our 
expression for the Lagrangian (Equation 1.68) that the first term is a kinetic 
term and the remainder is potential, so that 


1/206 42, 1 (ap? 
H= d 1.82 
f EG) ze TAA M82) 
is the form of the Hamiltonian. Substituting Equation (1.81) and using the 
orthonormality of the sine functions in the region x = 0 to x = L reveals that 


obs de. oce 
= A 1. 


confirming the view we had formed. The quantum version of this problem 
is now obvious—this is exactly as previously shown except that there are 
an infinite number of oscillators, and therefore an infinite set of the n, to be 
specified as occupation numbers to define the state. This does raise, however, 
the question of the zero-point energy problem. We have now introduced an 
infinite number of oscillators each with minimum energy 1c, so that the total 
energy of our “continuous chain of atoms” is infinite. The conventional view is 
that this is not a real problem, only the energy differences really matter. (After 
all, there is no concept of destroying this “crystal” into an infinite number of 
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parts and trying to extract the energy.) So you subtract the infinite number of 


30 contributions and take a new reference point for zero energy. 


The Final Step: Lagrange-Hamilton Quantum Field Theory 


We now have just about enough experience to attempt the real problem. The 
starting point will be some Lagrangian for a field $ (x, t). (In more physical 
models, later, this would be perhaps a multicomponent field, say the elec- 
tromagnetic field or the electron field. The excitations of the field will be 
identified with particles.) 

As a Lagrangian we take 


1/200 1[(90V a, 

JIBS £) 6°] ed) 
which is just Equation 1.68 with c — 1 and p absorbed into the field. (This is 
conventional. Notice that the dimensions of $ now vary with the number, d, 
of spatial, not time, dimensions. Withh = | = c, the only dimension is mass is 
proportional to (length)! is proportional to (time), and then the dimension 
of $ is $(d — 1) to ensure that the dimension of L, and H, is unity.) To be able 
to quantize, we need to extend the idea of conjugate momentum so that we 
can work in Hamiltonian form. Noting our treatment of the Lagrangian in 
Equation (1.69), we introduce a Hamiltonian density H, so that 


H= [rue (1.85) 
and define a “momentum field" x(x, t) by 
a(x, t) = o (1.86) 
ap 


in direct analogy to Equation (1.11). Usually this is referred to as the “mo- 
mentum canonically conjugate to $." Then by analogy with Equation (1.12), 
we write 


Hlo, x) = n(x, t)ó(x, t) - £ (1.87) 


and see from Equation (1.84) that for our model 


a(x, t) 2 (1.88) 


1 ass t) 
2 ox 


Hl, 1) = jr, t) 4- ) + — t), (1.89) 


which in comparison with Equation (1.82) is very encouraging. 
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To go to the quantized version of field theory we elevate the objects $ and 
7 to operators $ and #. Then we have to postulate appropriate commutators 
between them. (Please note that this is a new and postulated idea. We are 
working in analogy with quantum mechanics, but this lack of commutation 
between ¢ and # is not a consequence of, for example, x having become a 
quantum operator; on the contrary x is perfectly classical here. For this reason 
the field quantization is frequently referred to as "second quantization," and 
in the version we shall propose "canonical second quantization.") Now obvi- 
ously we wish to postulate commutation relations that mimic Equation 1.67 as 
closely as possible in the continuous case. The generalization of the Kronecker 
óij with two indices over which summation with an arbitrary vector, f, gives 
back the vector as 


> filij = fi (1.90) 
j 
is the Dirac delta function 6(x — y) with 


[fee - ax = Fy (1.91) 


for all reasonable functions f, as the defining property. Again, we notice that 
in Equation (1.67), as previously in Equation 1.28, the commutation relations 
are between the time dependent operators at equal times, for example, 


[Qi(t), Pj(t)] = isi (1.92) 


in the case of the most crucial commutators. We postulate, therefore, the equal- 
time-commutation relations 


[$(x, t), oly, t)] =0 
[A(x, t), fy, t)] = ió(x = y) 
[£(x, t), £(y, t)] 20 (1.93) 


with obvious generalization, through 9?(x— y), to three dimensions. Variables, 


such as $ and 7, related in this way are said to be “conjugate to each other." 
We now promote Equations (1.88) and (1.89) to 


242 
n EE 4, 1,,, 1 [0 T 
(x,t) =¢(x,t) and H= zÊ + 5 (3) + 2 $ (1.94) 


as operator equations. To find the eigenvalues and eigenstates of 
A= f Hdx 


we simply make a Fourier expansion. This time we shall not restrict ourselves 
to a box of length L, but let x run from —oo to +00, and use running waves 
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instead of standing waves. Now we expand as 


dk 1 


ó(x, t) =f $8 [a (k)e ikx— iot |. a*(k)e ep] (1.95) 


where o? = u? + k? and we have required ¢ to be real, so that 


dk 1 ikx—io * —ikx+io 
n(x, n= 2; 5, awe" t 4 a*(K)e Hor] (1.96) 


follows at once. (Do not worry about the conventional factors of 27 and 2w. 
It turns out to be very convenient in the relativistic interpretation. Just think 
of them as factors, which allow a smooth comparison with wave function 
normalization.) When we quantize, this becomes promoted to 


x % dk 1 kk ied 
(x, t) = Í 5; 2; ECO A + at ge o] (1.97) 


with corresponding expression for #, and the crucial point is that the Fourier 
coefficients have become operators. It is straightforward to see that the 
commutation relations (Equation (1.91)) determine the commutators 


[a(k), a(k’)] = 
[a(k), à* (K)] = 20 2w8(k — k’) 
[at (k)at(k’)] =0 (1.98) 


for the mode operators. 

You can see that @* and @ are almost certainly creation and annihilation 
operators for this continuum case. We need only substitute our expansions 
for 9 and £t into Equation (1.95) to see that 


A = a [a*(k)a(k) + 4(k)at(k)] (1.99) 


to confirm our hopes. If we rewrite this as 


7 dk 1 dk 1 o 


H= [5:259 c (K)à (k) + Bor Pi k), a"(k)] (1.100) 


then using Equation (1.99) we recognize the second term as the infinite sum 
of zero point energies, which we have learned to discard. Thus we can take 


dk 1 
A= [ue (80) (1.101) 
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and the spectrum will obviously follow along the usual simple harmonic 
oscillator lines. There will be a ground state, or vacuum, |0 >, determined by 


4 (k)|0 > = 0 for all k prime (1.102) 


into which 4!(k) will create a quantum of frequency o in the now familiar 
way. 

Notice the interpretation of the ground state as a vacuum with no particles 
in it. This is central to the interpretation of modern quantum field theory in 
elementary particle physics. 

To get a little more feel for this new structure, consider the amplitude of ĝ 
between the vacuum and the one particle state of momentum p: 


< 0ló(x, t)|p > =< 0ló(x, t)à (p)]0 > 


dk 1 i-i kei 
=< oi f ade + at dye rorya" (pyio >. 
(1.103) 


From the conjugate of Equation (1.103) we see that the second term gives zero, 
and with this trick in mind we rewrite the first term as 


e" [2 (k), à (p)]]O > (1.104) 


< Oló(x, Dip =<0 f ze 


to enable us to use Equation (1.99). Thus we find 


< 0là(x, t))p > =< TE eiet o ws(k — p)|0 > (1.105) 
eie (1.106) 


where o? = 2+ p? now, and < 0|0 >= 1has been assumed. You will probably 
recognize this as the wave function for this problem. (We actually looked at 
standing waves earlier, which are superpositions of these. But the u? = 0 case 
should be very familiar.) So this is where the old wave functions of quantum 
mechanics appear; they are vacuum to one-particle matrix elements of the 
field operators. 

Finally, think about the two-particle state 


la, k2 > = 4^ (k1)á! (k2)]0 >. (1.107) 


Because of Equation (1.99), we see that this is symmetric under the kı <+ k2 
interchange. There is no way to distinguish one quantum of energy (particle) 
from another—we must be dealing with bosons. Obviously, something will 
have to be modified later to handle fermions—and then the spin—and inter- 
actions between particles. 


20 


Group Theory for the Standard Model of Particle Physics and Beyond 


References 


1. 


2: 


3. 


4. 


10. 


E. Noether, Hachr. Akad. Wess. Gótingerm Math.-Phys. KP. II (1918): 235; M.A. Tevel. 
Transport Theory Statist. Phys. 1, no. 3 (1971): 183. 

R.P. Feynman, The Feynman Lectures on Physics, Vol. 2. Addison-Wesley, Reading, 
MA, 1964, chapter 19. 

H. Goldstein. Euler-Lagrange equation. Classical Mechanics, 2nd ed. Addison- 
Wesley, Reading, MA, 1980, p. 44. 

J. Poisson, Poisson Bracket, J. de l'École Polytech 8, (1809); 266, Whittaker, 1944, 
p. 299. 

W. Heisenberg, Heisenberg: The Uncertainty Relations. Zeitschrift für Physik 43 
(1927): 172. 

G. Sterman. Heisenberg Picture, Appendix A. An Introduction to Quantum Field 
Theory. Cambridge University Press, 1993. 

G. Sterman. Schródinger Picture, Appendix A. An Introduction to Quantum Field 
Theory. Cambridge University Press, 1993. 

G. Artken. Fourier Series. In Mathematical Methods for Physicists, 3rd ed., 1985, 
chapter 14. 

N. Highan. Kronecker delta. Handbook of Writing for the Mathematical Sciences, 
Society for Industrial and Applied Mathematics, 1998. 

P.A.M. Dirac. Quantum Mechanics, 4th ed. Oxford University Press, London. 


Problems 


11 Solve X = —o?x to find x = Xmaxcos(wt + 5) where ô and Xmax are 
constants. 


12 Write down the Lagrangian for a body of mass m experiencing the 
acceleration g due to gravity. Hence, solve the problem of a body 
dropped from rest. (Yes, it really is trivial.) 


13 For the harmonic oscillator, evaluate S for the three paths (a) q4 = 
sinot (this is the Newtonian path, it should give minimum S); (b) 
q —at;and(c)q = bt? adjusting a and b to ensure the same endpoint 
for all three paths. 


14 Write outd H to check that the Legendre transformation really does 
yield Hamilton's canonical equations. 


1.5 Show that if the kinetic term has the form 
1 m 
ij 


where i and j label the N particles of a system, and the m;j are 
generalized “masses” independent of the velocities q;, then H = 
T + V whenever the potential is velocity independent. (These cir- 
cumstances do arise whenever the constraint equations [from the 


Symmetries and Conservation Laws 


1.6 


1.7 


1.8 


1.9 


1.10 


1.11 
1.12 
1.13 


1.14 


1.15 
1.16 


1.17 


1.18 
1.19 


1.20 


variables you first use to the q;] are independent of time. Try 
writing the kinetic term for a single particle in two dimensions first 
in Cartesian and then in polar coordinates.) 


Check that Hamilton’s equations for Problem 1.2 do yield the same 
second order equation on substitution. Solve the first order equa- 
tions directly, and confirm the previous result. 


Start from A = ax + Bp where o and f are constants and "design" 
the form in Equation 1.19 for yourself. 


Show that the two solutions are equivalent and find the relation- 
ships between the constants of the two solutions. 


Check that the Poisson bracket equations of motion give the usual 
results for the particle falling under gravity. 


Check that matrix elements of Ê (t) between Heisenberg states agree 
with those of F between Schródinger states. (Yes, these questions 
are trivial.) 


Derive Equation (1.33) from Equation (1.28). 

Derive Equations (1.36) and (1.37) from Equation (1.29). 

If you are not sure about the meaning of Hermitian operators (like 
à and f) please look up the idea. As a check, show that f > -i2 
(in the Schrödinger representation) is Hermitian. 


The operator solution of the harmonic oscillator is of central impor- 
tance. To make sure that you have got the idea up to this point, close 
your notes and work it again starting with H = 2? + f? so that the 
constants are different. 


Go on - do it! 


Take the expectation value of H = 2? + f? for an arbitrary state, 
and use the hermiticity of € and f to show that you have a sum of 
moduli squared, hence, not negative. 


Assume C,|E,,, >= @*|E, >, where < E,|E, >= 1. Now use 
< En4i1|Eny1 >= 1 to find C, is real. Now show that Equation (1.51) 
is correct. Find the wave function for the first excited state, explic- 
itly. Hint: First use Equation (1.49) to find the ground state wave 
function. 


Derive Equations (1.57) and (1.58). 


This is a question you can ignore if you like. Try three equal masses 
in a line joined only by springs (of constant g) between the middle 
one and each of the end ones and otherwise free. You should get 
three equations of motion. Try solutions in which all three masses 
have a single frequency (normal mode, of course), to get three al- 
gebraic questions. Find the three values of the frequency that make 
these (homogeneous) equations compatible. (You can put the de- 
terminant at zero. Alternatively, pick out the zero frequency mode 
and the problem looks easy enough to guess the configurations of 
the other modes.) 


Please work out Equation (1.66) for yourself. 
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1.21 Please check Equation (1.67). Start by the usual commutators for 
the original variables and remember that [41, f»] = 0 and so forth. 


1.22 What will the wave function look like? Write the wave function 
explicitly for nı = 0 = rn. 

1.23 If this is not clear to you, try Problem 1.19 then think about Problem 
1.22 if there are three masses. 

1.24 Go on - do it! 

1.25 If you feel weak, just verify that Equation (1.77) solves Equation 


(1.76). If you feel strong try to prove that this is the general solution. 
(Change variables to x + ct.) 


1.26 Try superimposing two sine waves each of wavelength à and fre- 
quency v but traveling in opposite directions. If vp is the lowest 
frequency mode on a pipe of length L open at one end and closed 
at the other, what is the speed of sound? 

1.27 Goon - do it! [cos2A — cos2B = 2sin(A+ B)sin(B — A).] 

1.28 Check this please. 

1.29 Check again please. 

1.30 Check that the dimensions work out for [$(x, t)#( yth= i184 (x — y) 
in d space dimensions. 


131 Actually it is not quite so straightforward—you need to be able to 
invert the Fourier transform. But you can easily verify that Equa- 
tions (1.98) and (1.99) do give Equation (1.93), so please do this. 


1.32 Go on - it gets a bit messy - but you can do it! 


1.33 Show this, please. By now you really should be able to solve the 
harmonic oscillator by the operator method! 
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Index Notation 


Index notation is the modern and easy way to work through problems such 
as we now face. It has always seemed to the author that many good, even 
great, physicists have never learned this topic and therefore find sections of 
books and papers that do use it very hard or even impossible to follow. It 
is truly easy to learn and takes only a little practice to become competent in 
its use. Parts of this section will appear again later, sometimes as problems, 
so the equations here will be numbered I1, I2, and so forth to emphasize the 
point. 

Indices in this section are lowercase letters that can be attached as sub- 
scripts or superscripts to appropriate things, such as momentum or angular 
momentum. An example might be J; where J is an Hermitian J = J angu- 
lar momentum operator and i is a subscript in the range (1, 2, 3) or (x, y, z) 
specifying which direction of component is being treated. Such an index, ap- 
pearing once and once only, is called a free index. It is free for you to pick with- 
in the range specified. Such indices must balance in equations. For example, 


Aj = 7B; + 12C; (I1) 


would be acceptable whereas Aj = 7B or Aj = 7B; + 12C is not. An index 
repeated once and only once in an expression is called a dummy index and 
implies summation. This is known as the Einstein convention. For example, 


Aj Aj = Ay A, + Ap Ao + 4345 (12) 


where you must take great care not to confuse the power A; squared, that 
is, (A;)* with the second component ( A;)?. At this stage indices can appear 
either as subscripts or superscripts. The safe way to write things is to reserve 
numbers for powers (or use brackets). There is a mathematical theorem to the 
effect that there are two numerically invariant (i.e., not changing under, say, 


23 
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a rotation even if A; does) tensors. The first is the Kronecker delta 5;;, which 


is symmetric in the two indices and has the values 


áj-0 if igj 
áj-1 if isj. 


This has three important implications. 
ójj Àj = Ai, 
which can be seen by writing out the sum as 
i1 At + ôi2 Ap + 5:3 As, 
which then yields, for example, 
1x A +0x A+0xA=A 


if i = 1 and so forth 
Second, 


which can be confirmed by writing out the sum as 
631 9-022 + 633 =1+1+1=3 


in the three dimensions in which we are working. 
Third, 


OijÓ im = Sim, 
which can be seen by expanding the sum to read 


9i101m + 9i262m + 5;363m- 


(I3) 
(I4) 


(I5) 


(I6) 


(I7) 


(I8) 


(I9) 


(110) 


(11) 


The second numerically invariant tensor is the Levi-Civita tensor £;jx, which 
is totally antisymmetric in the indices with £123 = 1 by convention. Obviously 
£123 = 1 tells us that £231 = 1 and so forth but also £132 = (—)1 and so forth 


and €112 — 0. 
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Quantum Angular Momentum 


We define an angular momentum by a set of three operators, Ji, i = 1, 2, 3 or 
x, y, z, which satisfy 


Uo Jj] = i£ijk Jk (2.1) 
Pe (2.2) 


where fit = 1 in our natural unit, and the Einstein summation convention has 
been used. This simply means that an index repeated once is summed over, 
as distinct from those appearing once, which is a free one for you to pick. The 
same index appearing three or more times is an error. 

Here ¢;; is the Levi-Civita tensor (density), which is antisymmetric in any 
pair of indices with £123 = 1. 

Now the existence of this (Lie) algebra is very far from trivial and the 
content is very high indeed. We will look at the latter aspect first. Switch from 
the Cartesian basis to a spherical one by defining 


J+= Jj iJ2 (2.3) 
so that 
Us, Ja] =J (2.4) 
U+ J-]=2Js (2.5) 
with 
Ji = J. (2.6) 
Ji-1. (2.7) 
pua (2.8) 
Then consider the operator 
J? =]i]i (2.9) 
=]-J+ + Jas +1) (2.10) 
= J«J- + Jas - 1). (2.11) 
Notice that 
[544] 8I I; (2.12) 


-JjUsTil- Us IM. (2.13) 
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(You should write out [A, BC] explicitly to understand this point.) 
= i£ij(J jJk + Jk] j) = 0 by symmetry. (2.14) 
[Jie ]?] = 0. (2.15) 


Such an operator is called a Casimir. You can always use Casimirs as part 
of your complete commuting set of observables. We will take J? and J3— 
you will learn later that there can be no more—and set up the eigenvalue 
problem as 


]^8,m > = |f, m > (2.16) 
JalB, m >= m|B,m >. (2.17) 


Then returning to our theme of the angular momentum we see that 


J3J+1B,m > = QJ J+)1B,m > (2.18) 
Js + J2)18, m > = (m + 1)J+18, m > (2.19) 
also 
TA, m > = J+J’, m > (2.20) 
IB, m > = BJ+|B,m > (2.21) 
so that 
418, m > =c*(B,m)|B,m+1> (2.22) 


unless ct (8, m) vanishes for Mmay = j, say. Now use the normalization to see 
that 


c*(B, m)*c*(B,m) < B, m-- 1B, m-- 1» =< B, m|J ].18, m > (2.23) 
=< B, m|f J? — ]303 + 1)]l& m > 
(2.24) 
= B-m(m+1) (2.25) 

therefore 
Ic* (B, m)? = B — m(m +1). (2.26) 

Similarly 


]-18,m » =c (B,m)|pu,m—1-» (2.27) 
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with |c- (B, m)|? = B — m(m — 1). Of course, £ has to be positive because f is 
a sum J;J;* (therefore like the sum of norms of vectors). More formally 


B =< B,m|J7|B,m > (2.28) 
2 :«B mln «nl > (2.29) 
Sy esiti se FSO. (2.30) 


Clearly then, m cannot get too big for fixed 8, and c*(P, j) = 0 tells us 
Bj 1). (2.31) 
Neither can m become too negative, and 
JO +1) = Mnin(Mnin — 1) (2.32) 


tells us Mmin = — j (or j + 1, which is crazy). 
Finally, the step length is an integer so we see that j is an integer or half- 
integer. 


Result 

Jjm-2j(-0)5m- (2.33) 
]Jalj, m» —m|j,m > (2.34) 

—j <m < j with each taking integer (or half) values. (2.35) 

< j'm|J3| jm > = môjjômmw where 5j; is the Kronecker delta (2.36) 

< j'm|[?|jm > = j(j + 1)8jpBuw (2.37) 
with on = 629 = 633 =1 (2.38) 

and j =Oifi #j (2.39) 

< jm \Jalin > = VIG +1) -mm-z1)8jpónzim. (2.40) 


The coefficient of the square root is real and positive by phase convention. 
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Matrix Representations 


Any problem with precisely N eigenvalues and eigenvectors (all nonde- 
generate) may be represented on the space of 


Ay|n > — nn > n=1,...,N, (2.41) 


which has the matrix representation 


1 
0 re 
«r|AjJc 1|. . : j (2.42) 
0 N 
0 
0 
and |r >= i (2.43) 
0 
where the 1 is in the rth position. 
ee 
Spin 7 
l-.1 131 1 
Gee us + 2.44 
815,45 >= 5515-45 (2.44) 
1 1 l1 1.1 
— bb Se bel, oS. 2.4 
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This may be represented on the Ay space, with labels taken as 4 


venience, that is, 


We call 


and look at the matrix representations. 


< m|X3 


«m|X., 


< m|X_ 


Of course, 


1 0 
n > = 2Mômn > 03 = 0 1 


[13 0 2 
n>=2 zg Mm + Ts > o4 = 0 0 


The three o; (often called 1;) are called Pauli matrices. 

This last property is called completeness. Itisthe usual matrix completeness 
here or completeness in the Dirac sense. A proof of this might be obtained by 
expanding an arbitrary matrix M as 


| ee 1 
M = — M'o; + — M1 
A2 A2 
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H4 for con- 


(2.46) 


(2.47) 


(2.48) 


(2.49) 


(2.50) 


(2.51) 


(2.52) 


(2.53) 


(2.54) 


(2.55) 
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where clearly 


1 i 
M! = —Tr(Mo' 
pe 
1 
M? = —Tr(M1). 
Bo 
Then 
1 1 
M = 50\Tr(Moj) + 51Tr(M) 
so that 


2MË = (o) 5 Mj (o) +... 
Hence we get 
Mj {oflo}  ...] 20 


but M was quite arbitrary. 
Note that a general state of spin 5 is now written as 


with |a|? + |a_|? 2 1 


and the probability of "spin-up" being given by 


11 
—, -|Y ? jal. 
|< 55V > P= las 
Now deduce 
eije" = 515" — apo 


and eijk" = 218] 


and eine’ = 3!, 


(2.56) 


(2.57) 


(2.58) 


(2.59) 


(2.60) 


(2.61) 


(2.62) 


(2.63) 


(2.64) 
(2.65) 
(2.66) 


Addition of Angular Momenta 


Suppose we have two quite independent systems (well separated angular 
momenta, or distinct elementary particles carrying isospin, etc.) so that the 
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l | l l | 1 l = 


-G4 + jg) “Ja -jg jg ja Ga + jg) 
FIGURE 2.1 
The distribution of angular momentum. 
algebra is 
Lit, jf] = iei it (2.67) 
[i iP] = iege (2.68) 
Hoy (2.69) 


where A and B label the systems. We can take our complete commuting set 
of observables as (j^, (j^)^, j?, (j?)?]; then the states have the outer product 
form. = g 

It will frequently be convenient to describe the system by an alternative set, 
which includes the total angular momentum 


Jia jf + jP. (2.70) 


Moreover, (j^)? and ( j?)? are Casimirs. So an alternative complete commut- 
ing set of observables is {J ?, Jz, (j^?, (j?)?) and we label the states |J, M, j^, 
j? >= |J M > frequently when the Casimir labels are dropped. 

Now weneed to know how the eigenvalues and their ranges are related, and 
also how to find the coefficients (Clebsch-Gordan) in the linear relationship 
between |J M > and |j4m4 > |j8m® >. Well 


M=jf +j? (2.71) 
so you have 
M 2 m^ 4 m? (2.72) 


where we take A> B. 
Again, each J will carry the usual (2J + 1)M value so the number of times 
you get M is 


n(M) = 5 NU) (2.73) 
JzM 
where N(] ) is the number of times you get J . Hence, 
NJ) =n(M= J) —n(M=J +1). (2.74) 


Now we can find n(M). You get M by m, + mg subject to the ranges. Take 
jg < ja and draw a picture and work your way in from the extreme right. 
Clearly you get nothing until you get down to ja + jg (see Figure 2.1). Thus, 
n(M) = 0if |M| > jat js. 
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Clebsch-Gordan Coefficients 


To find the precise coefficient it is most instructive simply to work an example 
that shows all the features. 


3 1 
192229. : 
e 2 D 2 (2.75) 
+1 +3 
We must relate |1, 0 5 , 4) to 3 , ) and ; / i 
—1 ti 
Notice for speed that 
VIGE D- mm -1) = VG m) G x 1-m) (2.76) 
starts as J/(2j)(1) when m = j, then (2j — 1)2 and so on. 
We have 
s - fano s uta En 2.77 
2/57? — 3L iege gil at ee (2.77) 


and can repeatedly lower to get 


3 1 1 11 2 1 1 
=i e= 1,-1 3 LOCIS. 2.7: 
In 7 yg! [5r ctm gl YO Igecy 278) 


3 1 
rage a bed] 


and EAN (2.79) 


2^ 


Now, we need |}, +} >. Well M = m4 + mg tells us 


| >. (2.80) 


NI] =e 


1 -1 1 
—a|11 =, — bil, 0 > |=, 
>=al ze. 2 > +b| > 15 


11 
2:2 
Then: 
1. Normalization > |a|? + |b|? = 1. 
2. Orthogonality to < 3, 1| = a = —V2b. (You could get this also by 
[fd >=0) 
3. Condon and Shortley phase convention (theory of atomic spectra). 


{< jajal < je, J — jahiJ J > isreal and > 0.(j4 > jg) impliesa > 0. 


The coefficients relating the two sets of descriptions are called Clebsch- 
Gordan coefficients [1]. 
Hence 
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and we can lower to 
Eslsc ipe - "ET iS. (2.82) 
You can now solve these equations to get 
asi eet ERE ; > and soon. (2.83) 


The coefficients are called Clebsch-Gordan (or vector-addition coefficients). 
You get tables of them [1]. 


Notes 

1. Can read either way. 

2. If you do j? + j^ then 
(< j^m^| < j?m* HJ M» (71) ^P {< jm] < j^m^l] M>. 

(2.84) 
which is important mainly if j^ = j? and J — j^ — j? is odd. 

3. Spectroscopicnotation75+1(S, P, D, F, G...forL); where S, P, D, F, 
and so forth stand for the spectroscopic series sharp, principal, 
diffuse, fine, and so on. 

4. When adding L to 7 they use lowercase letters and leave off the spin 
multiplicity superscript. For example, 

je s; and pi 
j= 3 p3 and ds 

5. General formalism follows from completeness: 


|JM>= | 3 |mamg >< ms JM> 


m amp 


= Y < mamsg|J M > |mamg > (2.85) 


T ATL S 


where < mamp|J M > is CG - real by our conventions. Similarly, 


[mAmpg > = (Sum >< E) |mampg > 


JM 


= «JMImams > |J M> (2.86) 
JM 


which is why the tables read both ways. 
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6. Notation. 


< j^j?m^mP|j^j?] M —— 
C jajs(J, M;mamp). 


sjaje 


] mamg* 


Wigner "3 — j" symbol 


“A (B E H 
JEJ J —1yatM i, RM 
( P E a = ( ES < j^j?m^mP|j^j?] M». 


Matrix Representation of Direct (Outer, Kronecker) Products 


We have worked with states [4 > |mg >, and used J; = (j4 x 1)+ (1 x jP) 
in a formal way. To represent this on matrices we need the direct product 
construction. 

If ¢ and 6 are (m x 1) and (n x 1) matrices (columns) you can form the 
direct, or outer or Kronecker product 


$101 
$10» 


(6 x 0)io = iba = (2.87) 


$501 |’ 
$20» 


which is an (mn x 1) matrix. More generally, if Ais m x m and B is n x n then 
the direct product is 


anB annB ++; AmB 
az B SS 

(Ax B)ia, jp = (AjBag) =]. " . |, (2.88) 
AmB Eua Amn B 


which is an (mn x mn) matrix. 


Quantum Angular Momentum 


35 


1 Q 1 = 140 in Matrix Representation 


Recall 
or gt 
l > > 2 and |0 >=> " à 
Then 
$4.04 
$..0— 
I$ -]0--— 0. 
$0. 


and 1x j? 


1 


Then you have the more complicated looking structure 


(i) 11 0 (i) 12 0 
0 (if) 11 0 (if^) 12 


leading to: 
0.1 1 0 0 -i -i 
111001 1|i 0 
hs qu h-3'!i 0 
0.1 1 0 0 i i 
1 2 
0 
J3= 0 J?’ = 
—1 2 


Notice that J? is not diagonal in this | > | > basis, of course. 


98) (aah jis d jË 0 
0 jB ac peo ji acts on |0 > 


(2.89) 


(2.90) 


(2.91) 


(2.92) 


(2.93) 


(2.94) 
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Checks 
1. 
1 
1 1 que en UE 1 0 
I1» —l5, > @|=,=>| > implies okl*lol* lo 
0 
(2.95) 
Clearly J? and J; have the required action. 
2. The Clebsch-Gordan tables give 
1 1 1 11 1 11 1 1 
10 > = l 2.96 
POP Bg 2s 9327750909 5-57 (220) 
0 
1 
Therefore represented by P. 1 | Again J? and J; check out. 
0 
3. You may observe that 
0110 
0001 
J=]o 001 (2.97) 
0000 
0000 
z 1000 
and J} = 1000 (2.98) 
0110 
have the appropriate effect. 
4. The Clebsch-Gordan tables imply 
1 11 1 1 1 1 1 11 
= 2. 
0,0 > ale 2? *ls a NE 2 S (2.99) 


0 


1 
Therefore it is represented by a _1 | Again J? and J; check out. 
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Change of Basis 


(Go from mA, mg to J, M.) 


1. < J M|y >=< J Mimampg >< mamg|v > ( Use| >< | = 1.) (Note: 


Change of basis unitary < J M|mm >< mm|] M >= 1,i.e., UUT = 1.) 
Vi (60) 10 0 0O]r($0) 
Wo zleo) + ($0)3] 0 5 a 0 | | (68)2 
v3 | (60). 0.0 0 1]|/|(9$9) 
Va zleo) — (06)3] 0 z =z 0| L9 
(2.100) 
We have picked the label ordering so that 
1 0 0 0 
0 1 0 0 
1,1 > => 0 ; 11,0 >=> 0 ; 11, -1 >> 1 ; 10,0 >=> 0 
0 0 0 1 


2. < J MOpIJ'M' >= 
< J Mimamg >< mamg|Op|m';mg >< MA4Myg| J'M' > therefore 


1 0 0 0 2000 1 0 0 0 
1 1 1 1 
TAN 0 vi- us 0 011 0 0 UU US 0 
— 0 0 0 1 0110 0 0 0 1 
1 1 1 1 
0 mm. fes 0 0002 0 m CES 0 


38 Group Theory for the Standard Model of Particle Physics and Beyond 


Exercise 

Do J, and J_ 
0100 0000 
0010 1000 

1-219999 Io--7210100 
0000 0000 
0100 0—1 00 
111010 ; |1 0 —1 0 

Lr EE 20 2= | 0 0 0 
0000 0 0 00 


Clearly you now have the direct sum J ! (D J ?. 


Note: Because 2 + 2 = 2 x 2 there is a standard error in this problem. One 
is tempted to start with the direct sum 


jf 0 $ 
0 n acting on à 


but this is not what is meant by the angular momentum of the system. 


References 


I found it impossible to find correct and appropriate references for this material and 
advise readers to follow the instructions in the next paragraph. 

The coefficients are called Clebsch-Gordan (or vector addition) coefficients [1]. 
You can get tables of them. (Write to CERN Scientific Information Services, CH-1211, 
Geneva 23, Switzerland, and ask for a copy of the Particle Physics Booklet of the current 
academic year. It is free and there is a new one each year.) 


1. Particle Physics Booklet p. 266. 


Problems 


2.1 Write out all the components of jj. 
22 Show that Q0; = Qj. 

2.3 Show that ô; = 3. 

24 Show that cj E 
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2.5 
2.6 
27 
2.8 
2.9 


2.10 


2.11 
2.12 


2.13 


2.14 
2.15 


2.16 
2.17 


2.18 


2.19 


2.20 


Show that ojo; = ô;j1 + i£ijkok. 
Trace o; = 0. 

Show that Deto; = —1. 

Show that (0;)f (01) 21217 = 0. 


Carefully specifying your notation, work out spin 1 in matrix form. 
Hint: At this time the raising or lowering of an index has no mean- 
ing. These positions are used for clarity. Start by counting the in- 
dices on the left-hand side of this equation (you should get 6). Now 
ask how many indices and ó's you need to match (you should get 
3 x 2 = 6). Now ask how many indices are on the left-hand side 
of this equation, and recall that ¢;;, is antisymmetric. Note that the 
lower indices on the right-hand side of this equation match those 
on the left-hand side. Note that the numbers of the left-hand col- 
umn are fixed to match these and that the upper indices start by 
matching those on the left-hand side but then are cycled down the 
left-hand column, but a pair (here m and n) are switched in the top 
of the right-hand column and then cycled. 


Convince yourself that 
8; ô; món X9 ôi ô; ng, "t 
£j c" ond 5; má; ng! "A ôi "bj mg! . 


+ 8; "5; !5, " Le 8,75; ! 7 


Show that à; /8; " = 4;". 
Now deduce that 
scent =, iy ng, m 
and ejje" = 215;! 
and ejje! it = 3t. 
Keen students extend to N dimensions and worry about slight 


changes in signs of metric components. 


Work out gradient, divergence, and curl where Gradient ¢ = Vio, 
Divergence V; = V; Vj, and Curl V; = ei; Vj Vi. 


Show A— a A0 where A is antisymmetric. 

Show A; = AXI Aa». 

Show A = 1o; Tr(Ao;). 

Show Af = 1(o3)f Æ (03)5.. 

Taking a complete set of observables as ( j^, (j Ay TBs j?Y) show 
that the states in this basis have the outer product form. 

Show that when |j^ j?m^m? >= |j^m^ > |j" m? > then the action 
of the operators is j^ > j x 1 and j? — 1x jf. 

When the total angular momentum is defined by J; = jA+ jP, show 
that the commutator [/;, J j] has the usual form. 
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221 


222 


2.23 


2.24 


2.25 


2.26 


2.27 


2.28 


2.29 


2.30 


2.31 
2.32 
2.33 


2.34 
2.35 
2.36 
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Drawing the straight line figure for M running from (—)(j4 + jz) 
through (—) j4 and (—) jg and zero, and jg and jato (j4+ jg) show 
that the number of times you get j4+ jp is just once for M, but twice 
for ja + jg — 1 and so forth until you reach M = j4 — jg. 


Show that the number of times you get M, is given by n(M) — 
ja+ js - 1 —- |M|, if jat jg => Mz |ja— jel. 
Show that now you finally reach (2js + 1) when you have used all 


(2jg +1) values of mg and moving further left you find a sufficiently 
negative mg to make m4 = ja. 

Show that you now have an angular momentum of the form n(M) — 
2js - 1if |M| € | ja — jal. 


Finally show that N(J) = 1 for J = ja+ jg, ja j& — 1, ...ljA— jel- 
Hint: The changes of n(m) are what really count. 


Consider the case 
1e ra e : 
2.725727 
Show that the stretched state is 
33 1 
I575 >=]|L1>l53 E 


Apply the operator J_ to show that 


31 "E 11 E 1 1 
=s BTS esas a 
I3: 5 Pay glh Om lag t s t= I 757 


Find | L ti > by writing down a linear form with two parameters 
and applying normalization and orthogonality. You may assume 
the Condon and Shortley phase conventions that if ja > jg then 
{< jajaljs, J] — jal}lJJ > isrealand > 0. 


Finally show that 


i1,0 ie 1 2 1 dero 1 
,U> 5 a ; > , ss 
2 2 J/3 2 2 rao 12. 


Work out 4 &9 4 in full, construct the table, then check against the 
CERN booklet mentioned in the References section. 


Show A x B z B x Ain general. 
Show (A+ B) x C = AC + BC. 


If Aand A’ are both (m x n) and B and B’ are both (n x n), show 
that (A x B)(A’ x B’) = (AA) + (BB?). 


Show Tr(A x B) = Tr A.Tr B. 
If A! = A and B! = B~, then show (A x B)! = (A x B)-!. 


Work through in detail $ & } = 1 @ 0 in matrix form to check that 
it gives what you expect. 
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Tensors and Tensor Operators 


Think about a collection of points in a space. Usually the ordinary points 
of our three-dimensional space are what we shall need, but sometimes the 
“points” might be the states of a quantum mechanical Hilbert space. (You 
might have to deal with a manifold, hyperspace, variety, etc.) Give the points 
labels so that you can find them again. Say we use x! where i takes values in 
the appropriate range, for example, i — 1, 2, 3. Then move the points of the 
space (I take the active viewpoint), and the point that had labels x' now has 
x” as labels. For the physics we have in mind, we assume that the functions 
give the new coordinates x^ in terms of the old x’. 


x" sepa scam (r 2 1,..., N) (3.1) 


are continuous and differentiable (real, if the coordinates are real), and the 
functional determinant (Jacobian) 


ox. 9 
Ox! OxN 
e (3.2) 
OxN XN 
Ox’ OxN 


never vanishes. (Of course, you put up with singularities such as that at the 
origin in the change between Cartesian and spherical polar coordinates.) In 
principle you can then solve to get 


x = g(x", xP, re XN) (r 21,2, ..., N) (3.3) 


and this is called the implicit function theorem. 

Now what are the objects that are of interest to a physicist? Those things 
that have simple properties under the transformations. We will list some of 
the more important ones. 


Scalars 


Often called invariants, scalars are numbers, or more importantly results of 
measurements, that are numerically the same for the new system measured 
with the old apparatus. 
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FIGURE 3.1 
The temperature plotted againt position. 


Scalar Fields 


Sometimes called scalar functions, scalar fields are a set of numbers (one at each 
point) that "go round with the system." Examples are physical functions of 
space on a body, for example, temperature. We describe this by a single (no 
index) function of the coordinates, $(x', ..., x"), but this is not a definite 
(fixed) mathematical function even though we do not bother to change the 
symbol ¢. Suppose, for example, you heat a bar uniformly to 1 degree to the 
right of 1 meter from the origin, so the temperature function is given by 


(x) = 6(x — 1). (3.4) 


(See Figure 3.1.) 
Now transform the system a distance d to the right. For each point x > 
x' = x + d. The plot of temperature now looks like 


$' (x) = (x +d) = o(x) = 6(x — 1) (3.5) 
of course. As a function we now have 
$'(x) = o(x - d) = 6(x -1- d), (3.6) 
which you can see in Figure 3.2. In general, 
$(x) ^ $(x) = (x) (3.7) 
and if 
x—>x'=Tx, then d$'(x)-—49(T x). (3.8) 


(The physical function is invariant; the mathematical one changes.) 


Invariant Functions 


Invariant functions (e.g.,r? = x?--y? - z? under [3] rotations) have f(x’) = f(x) 
for all allowable points x’ and x. This can be thought of as a form invariant 
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Temperature 
4 


FIGURE 3.2 
The temperature plotted against position translated. 


scalar field. Such objects might be the wave functions of states that are invariant 
under rotations. 


Contravariant Vectors (t — Index at Top) 


Two neighboring points define a vector P Q at P, which has coordinates dx” 
(see Figure 3.3). 
If we transform, then 


Ir 
dx'r = 
axs 


di (3.9) 


where we notice that 2n. is fixed (for each given point P) by the transforma- 
tion; it does not depend on Q, that is, it does not depend upon the vector dx. 
This transformation is linear and homogeneous (affine) in the dx”. More gen- 
erally, we define contravariant vectors by sets of components B/ (associated 
with a point P) that transform like 

Bi > B” = pide 


TÈ BI(D D} (3.10) 


FIGURE 3.3 
Two neighboring points define a vector PQ at P. 
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where the B/ and the (D are evaluated at P if it matters. This last (ludi- 


crous) definition makes sense when you go on to introduce covariant vectors. 


Covariant Vectors (Co — Goes Below) 


Covariant vectors have lower indices and a rule: 


ax! 
"i 


A> A= 5 


Aj = DÍ A; (3.11) 


where the last step follows the chain rule of differentiation: 


e axi  ox* axi 


poU C mS eB 1j D 5DEL (8.12) 


An example (usually the prototype) of a covariant vector is provided by the 
gradient of a scalar 


ap — 
sr = ae (3.13) 


since 


ap OPE | 0o (x) ax? 
Ox ai? I a: PO) (00 axi Ox" ey 


You usually deduce this from the remark that the difference between two 
scalars at nearby points 


p(x + dx) — 6(x) (3.15) 


had clearly better be invariant. Thus in the limit dq x" has to be invariant, 
and hence 


has to follow. 


Notes 


1. In general the x” themselves are not components of a vector at all. 

2. In general D depends on the point x, often in a horrible nonlinear 
way. The presentation given is now viewed as old fashioned by math- 
ematicians but it serves our purpose very well. 
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Tensors 


The previous ideas extend to tensors of any rank (number of indices) of mixed 
contravariant and covariant types: 


pp cc qo LUE pu qup. uu) 


Notes and Properties 


1. 


Scalars are zero index tensors, and contra(co)variant vectors are single 
index tensors. 


. Because the transformation laws are linear in the tensor components, 


you can add or subtract constant multiples of identical tensor types. 


Cabe — Aube 4 7pabe (3.18) 


. You can form outer products, which get the appropriate transforma- 


tions, 


VW, = T5. (3.19) 


. Just as (9, 9)d x* was a scalar, there is an extension to inner products 


by contraction of contravariant against covariant indices, 


T4 A = B,. (3.20) 


. Symmetry holds after transformation, 


T^" = +T". (3.21) 


. 6] is a mixed tensor. 


. Theorem: Given some objects S`”, and that 5 ^T is invariant for all 


T”, then S form a tensor. 


Proof: Suppose that S — S instead of S' under a transformation. 
Both S'T' and ST’ are equal to ST. Hence, (S — S’)T’ = 0, but T’ are 
arbitrary. 


. Reduction of tensors is performed by symmetrization (or antisym- 


metrization) of all index pairs of the same type and extraction of all 5/ 
pieces by contraction, until an irreducible tensor is reached. (If there 
is an ej; tensor [see later] this must be taken out too. Also [shortly] 
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we shall learn how 4;; and ô exist for rotations; then these must be 
pulled out. In general, any invariant [numerical] tensor must be used 
if it exists.) 


(a) T = S + AÏ where (3.22) 

su = sri + TÏ) and (3.23) 

Al = sri - TÏ). (3.24) 

b) M/ =M + Laj MÉ (3.25) 

has no trace, and X = Mt. (3.26) 


Notice that, in (a), A has three components and can be related to a 
covariant vector by 


1 . 
V; = zeA" (3.27) 
AP = ej V (8.28) 


if you like. In all examples, practice and ingenuity are important. Try 
some. 


9. Repeated transformations (transitivity). Shift from x to x', then x' 


to x". 
Ir 
Paya (3.29) 
axs 
ax 
then V" — y" = — V" (3.30) 
ox? 
ox" ox'? 
= ETT x5 V? (3.31) 
a yr 
= = Ta (3.32) 


(This obviously always works!) Why is this useful? Now that we have 
established transitivity, since the transformation laws are linear and homoge- 
neous, an equation true before transformation is also true afterward. (From 
a passive viewpoint, an equation true in one frame is true in all others.) So 
such statements are about the physics, not just the way in which the observer 
sees it. 
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Rotations 
At this point I feel that you have enough of the general definitions to work 
on yourself whenever you need, so I shall move on to rotations in Euclidean 
space. Then the relationship between the x” and the x' is linear, real, and 
preserves distances. (We also stick to right-handed axes.) The linearity means 
that we write 
x! = xi(M D; (3.33) 
x! = x' (My, (8.34) 


with M a matrix of numbers (i.e., independent of x). Obviously, 


ox) 
oxi — 


= MÍ (3.35) 


so that D/ is identical to M/ in this case. The x/ are the components of a 
contravariant vector for rotations. The reality of the transformations means 
that D has real elements. (You can also show that sticking to right-handed 
coordinates means that the determinant of D can be fixed to be unitary.) 
The length (Euclidean) of a vector is defined by the sum of squares of its 
components, of course, so we have 


xx = xx! oU ONT (3.36) 
In matrix language 
MM! =1 (3.37) 


and the matrix is orthogonal. Now that is handy. In matrix language you had 


A— DA (3.38) 
B > (D!)"B (8.39) 
but D = M, so that 
A— MA (3.40) 
B — MB (3.41) 


and for rotations covariant and contravariant are equivalent. We shall just 
speak of vectors. You can put the vector or tensor indices up or down at will. 
The numerically invariant tensors 5/ and e; jk are now available as 8j;, ó' ij KFA 
and so forth. 

Now we return to the study of interesting objects with simple transforma- 
tion laws. 
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Vector Fields 


Vector fields are physical functions of space (and time) that are vector valued. 
A set of (three) numbers at every point of space are components of a vector at 
that point and "go round with the system" (e.g., the velocities at all points of 
a fluid at one time). To make this more precise, we introduce more notation 
for rotations in our [3] space. Take a set of axes (orthonormal) e;; specify a 
rotation by a set of (orthonormal) vectors n,, which you get by rotating vectors 
originally coincident with the e;. (The e; are fixed for all time.) 
(Note: The i on e; or n; is a label.) 
Then 


—e,;Rj (3.42) 
Clearly 

(3.43) 
and if you wish to specify the rotation in detail you would probably give the 
axis of rotation and the right-hand-thread rotation around it—R(u, 0)—or 
you might use Euler angles. 


Let us connect this with our previous ideas about vectors. The components 
of vectors relate to the labels on the fixed basis vectors by 


Y = Ve; (3.44) 
V=Vi-e,. (3.45) 
Now, a vector “goes round with the system” so that 
Vie; = V' = R(V) = Vin; = Vie; Rij = (Ry Ve, (3.46) 
therefore 
Vi = RV. (3.47) 


Good! Rj; = Mi; = Dij. And, of course, all of this extends to tensors. (We are 
not deliberately awkward in having three “matrices” (R, M, D)—when you 
wish to read on then it is important to know that they are logically distinct 
but coincide in the present case.) 

Now to return to vector fields. The precise form of the transformation is 
given by: 


V(x) = ViG9e; (3.48) 
Vi(x) = V(x) - ei (3.49) 

so that 
e, V/(x) = V'(x) = RV(R™!x) (3.50) 


as above. 
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(In words: The new vector at x is the rotation of the old vector at the point 
that moves to x.) Therefore 


Vi(x) = Rij Vj(R^!x) (3.51) 
{Scalar field y'(x) = W(R7!x)}. (3.52) 


(In general you distinguish covariance and contravariance.) 

It is probably worth reiterating that although the physical function is un- 
changed (y'(x) = wv(x)) or merely has its components mixed (V/(x') = 
Rij Vj(x)) we speak of the change (mathematical) of the objects as 


vow with w(x) 2 y(R 1x) (8.53) 
V > V) with V(x) = R;Vj(R 1x) (8.54) 


and you have to keep this in mind. (But physically y(x) > v'(x') = v(x).) 
Tensor fields are obvious generalizations of vector fields. Now for a new 
class of objects. 


Tensor Operators 


(This is still classical, not quantum.) 


Scalar Operator 


Not affected by rotation. If it gives a measurement, then the result is a scalar. If 
itis composed of the components of tensors, then itis so cunningly constructed 
that it “undoes” their transformations. Compare this with invariant function 
and contrast this with scalar field. 


Vector Operator 

These objects (N in N dimensions), which are called the components, go 
round with the system like vectors would (not like the components of vectors). 
Remember that 

(3.55) 


for vectors originally coincident with the e;. So we have 


Kj KjRj = (R^ Kj. (3.56) 
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As an example, recall how our prototype of a covariant vector was 


2 pa) 


ox! 


where $(x) was a scalar field. We can think of a (fixed) vector operator V such 
that 


d d 
wee { or (Ya): = z 


Now we already know the effect here is to write down the rate of changes, 
and we can think of writing 


V = e! V; = n' V; (note order for safety) (3.57) 


so that you envisage a set of three operators that change to compensate the 
e — n. Clearly then 


eVi =n'V; (3.58) 
- e DV, (3.59) 

therefore 
Vi = DIV), (3.60) 

therefore 
V =(D"!)/V; (inverse). (3.61) 


This concept extends to tensor operators of all kinds. Examples abound. (In 
quantum mechanics the usual r;, L;, pi are all vector operators.) 


Notes 


1. Ingeneral thereis norelationship between collections of objects form- 
ing tensors and those forming tensor operators. 


2. Reduction of tensor operators to irreducible (or spherical) tensor op- 
erators follows exactly as it did for tensors. 


3. If you compare 
K" = KİD; and (8.62) 


K; =(D"1)/K; (3.63) 
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with Equation (3.52), that is, 
B! = Bi(D-)] and (3.64) 
A = Dİ Aj, (3.65) 


this is said to be cogredient transformation by some authors. Warning: 
Someone taking a passive point of view can "send his axes" through 
(—0) to "agree" with us—then the transformation laws for K; and A; 
coincide. This looks neat but hides the problem. 


Connection with Quantum Mechanics 


The states of systems get “moved around” by transformations—you can think 
of the states as points. The Dirac description is in terms of the state vectors |y > 
(in Hilbert space), and the components (which might be (x) =< e;, x|v >, 
say) are the corresponding wave functions that you can think of as coordinates 
if you like. 

Transformations are represented by operators (quantum mechanical this 
time) that act on the states to give new ones. Although norms (scalar products) 
are preserved, you must remember that complex conjugation is involved: 


If |y  |y' > =Uly >, (3.66) 
then <wW'|W’ =< y|y > UU — 1. (3.67) 


So the operators representing transformations in this space are unitary. 


Observables 


(Represented by Hermitian operators). All dynamic variables have to trans- 
form with the states. If you have 


Aly > =|6> (3.68) 
Aly’ >= |¢' > (3.69) 

for arbitrary states, then 
AU|y > =U|¢ >= UAlļy >, (3.70) 


so that 


A’ = UAU™. (3.71) 
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In particular, if the observable gives a measurement then it must transform so 
that the eigenvalue for the rotated state is the same as it was (for the original 
observable) on the old state: 


Alan >= 4y|an > (3.72) 
U Alas >= a Ujan > (3.73) 
(UAU)U|A, >= anUjan >, (3.74) 
therefore 
A’ = UAU”. (3.75) 
Rotations 


(Think about ordinary ones in three-dimensional space.) Corresponding to 
each rotation, R, in ordinary three-dimensional space there is a unitary oper- 
ator, LI( R), which transforms the states. All the “nice” properties are preserved 
correctly, so there is no need for most of us to worry about the technical side 
of things most of the time. 


u(1) 21 (3.76) 
U(R™) = U-(R) (8.77) 
U(Ro Ri) = U(R2)U(RI) (3.78) 


with the usual convention that the one written to the right is applied first in 
each case. 


Scalar Fields 


Scalar fields often appear as quantum mechanical wave functions. It is im- 
portant to notice that we regard the reference bras « x, e;| as fixed when we 
envisage a rotation, but we can then think of the action of the operators on 
them. Thus under a rotation R we get: 


ly > > |v’ >= U(R)|v >, (3.79) 
<x|—>< x], (3.80) 

and so 
w(x) =< x|y >> W(x) =< x|y' > =< x|ul|v > (3.81) 


where we then go on to say: 


U|x > = |Rx > (3.82) 
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but 
uut=1, thatis, uU^-u (3.83) 
therefore 
Ut |x >= |R^!x >, (3.84) 
so that 
< Ro'x| =< x|U, (3.85) 
and hence 
V (x) =< xluly >=< R!xly >= (Rx), (3.86) 


to establish the connection with our scalar field as defined earlier. 


Vector Fields 


Vector fields often appear in a similar way. If we are describing a system with 
angular momentum, then the wave function will have three components that 
form the components of a vector field at each point. (This could be a hydrogen 
atom in a state with labels n — 2,1 — 1, m — (4-1, 0, —1).) We would have 


«x, ej > = vi(x) (3.87) 


where we have taken our usual fixed axes in a Cartesian basis. Then, under a 
rotation R, we get 


ly > > |y" >= U(R)|y > (3.88) 
<x, e| >< x, e;| (3.89) 
vi(x) =< X, ely > >< X, e; >= Wi (x). (3.90) 


To go on from here you have to be quite specific in handling the reference 
states (vectors)—let’s do this in full detail just for once: 


U(R™) |x, e; > = [R^ (0, R'e; > (3.91) 
= IR" (e Ry > (3.92) 
= |R (z), Rye; > (3,93) 
= RIR (xy, e; >. (3.94) 
That is 
< x, ej|U(R) = Rij < R^! (x), eil >, (3.95) 


since R is real and does not get transposed here. 
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Then 
Wi(x) =< x ev > (3.96) 
=< x eillv > (3.97) 
= Rj < Rosy, (3.98) 
therefore 
Wi(x) = Rjv;(R ^ !x), (3.99) 


and we have made the connection with what we called the transformation 
law for vector fields earlier. 

The extension to see how tensor fields occur as wave functions of states 
with high angular momentum should now be obvious. 

A form invariant scalar field (invariant function) can appear in the form of 
an invariant wave function. If we have a state (such as the ground state of hy- 
drogen with a wave function y (x) exp(—)) that is invariant under rotations, 


|y >> |y! > -Uly >= lv >, (8.100) 
then 
W(x) =< x|y >> « xl’ = v'(x) >, (3.101) 
and now therefore 
y(x) = y(x) (3.102) 


and we retrieve form invariance. 

Scalar operators (invariant operators) often appear as those which give 
measurements which are scalars. These are fixed operators A' = A, and since 
a general state transform as |y >— U|w >, we deduce that 


[A  U] =0. (3.103) 


The generators of the transformations (J; for rotations) all commute with 
scalar operators. (A good example would be the Hamiltonian operator of a 
rotationally invariant system [ground state hydrogen atom]. This measures 
the energy that is unchanged by rotation.) 

Obviously you try to do measurements associated with scalar operators, so 
that, although you never move your fixed observing apparatus, the operator 
remains appropriate since its change (as a dynamical variable) would have 
been no change at all in this case. 

Tensor operators arise when you try to do rotations of a system and insist 
that the expectation value of some fixed operators (associated with apparatus 
in your fixed reference frame) form the components of a vector (or more 
generally, a tensor). That is, you have: 


< y|P;y > > Rij < VIPjlv >, (3.104) 
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but you know that |y — U(R)|v, so that 


< y|Piy > >< vu t Puy > (3.105) 

and hence 
U PU = RyP; (3.106) 
or UPBU = R; P; = P;Rj. (3.107) 


If you have observables, or dynamical variables (these are quantum mechan- 
ical operators), which coincide with these originally, then they form the com- 
ponents of a vector operator according to our earlier definitions. (Their ex- 
pectation values between fixed reference states also form a vector operator.) 
Examples include position momentum and angular momentum operators. 
The extension to tensor operators of arbitrary rank is now straightforward. 

Note: You can see how the game is starting to get complicated; you can 
dream up all sorts of weird objects. But the ones I have indicated are the 
most important ones for physics. One last warning: If you do quantum field 
theory then the whole game gets yet one more level of complication. But our 
definitions and so forth are fine if you work with care. 


Specification of Rotations 


There are many ways to specify a rotation, but we will stick to the crude idea 
of defining an axis by a unit vector n, and then specifying an angle of rotation 
0 about that axis in the right-hand thread sense [1,2]. Note: (1) Do not confuse 
this n with the moving axes. (2) Read about Euler angles. 

For example, if we have V; > Rj;V; by an angle 6 around the z-axis, then 
because R = M = D, and RT = R^! we know 


x" = X (RU or x, = RrsXs (3.108) 

[x'] = Ix] R71 or [x'] = (R71)" [x] = R[x] (3.109) 
becomes 

x' = xcos0 — ysind (3.110) 

y = ycos0 + xsin (3.111) 


PA (3.112) 
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Hence, we write 


cosÓÜü —sinéd 0 


Rey 8) = sinf cos0 0 (3.113) 
0 0 1 
and 
V V —8 0 V 
ty R =|] 1 0 (3.114) 
0 
V 0 —1 0 V 
2 +09||1 0 0 (3.115) 
0 


working with infinitessimals. We can write this generally as 


Vi > Vi = V; + Oeijkn j Vg + O(0?) (8.116) 
= Vi + ij; Vi (3.117) 

that is, 
V' = V + 8n x V+O0(6?). (3.118) 


Transformation of Scalar Wave Functions 


Suppose we have a scalar wave function so that when we transform the system 
the mathematical change in the function is given by 


w(x) > w(x) = Y(R x). (3.119) 
For our small angle rotation about the z-axis we get 
V'(x) = v(x + Oy, y — Ox, z) + ...(Just R (0) = R(-0)) (3.120) 
B ay ay 
= w(x, y, z) +0 (vx — 23 +...(Taylorexpansion) (3.121) 
= y(x, y, zZ) — Lya, y,z)-T... (3.122) 


where L; is the usual orbital angular momentum operator. 
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We write 


W(x) > R.(A) v(x) 
with 


id 
R-(0)=1- pde s 


h 


and this extends to 


T2 
R(x, 6) =1- 5n- Le... 


if we rotate about an axis specified by n. 
We define 


i0 


R(n, 0) =1— 
(n, 0) 7 


«Ts 


57 


(3.123) 


(3.124) 


(3.125) 


(3.126) 


when the system has no classical analogue, that is, for odd-half-integer spins. 

Note: Atthe moment we do not know that all of this is consistent. The result 
of two rotations around different axes has to have the appropriate effect as a 
rotation. We must check that this ties up with the usual commutation relations 
we assigned to components of angular momentum. We shall come to this 


shortly. 


Finite Angle Rotations 


We build up R(n, 6)—now for finite 6—by repeated application of small 


rotations around the same axis n. 


R(n,6 -- d0) = R(n, dO) R(n, 0) 


i 
= (1 = =: 140) R(n, 8) 


therefore 
d 


and even we can solve this equation to get 


R(n, 0) = exp (-7) n-J 


since 


R(n, 0) = 1. 


i 
jg E 9) = -7r T ROS 8) 


(3.127) 


(3.128) 


(3.129) 


(3.130) 


(3.131) 
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Consistency with the Angular Momentum Commutation Rules 


Consider our rotation U(R) specified by R(n, 0), and think about the compo- 
nents K; of some vector operator. We know that 


UK;U^! = KR; = (RI) Ky, (8.132) 
id id 
exp (-F2 - 1) Kj exp ($u l 1) mk (8.133) 


so that expanding 


0 io 
(i- Te Leo (14 fe Le) =K- ente (3.134) 


where the last step follows from the law we deduced for V; with 0 — (—)0. 
Thus 


i0 
Kj - 7 ili, Kx] ve —Kj-esjy0niKy c... (3.135) 


and we deduce 
[ i, Kj] = iheij Kr (8.136) 
since n was arbitrary. 
Now we can put it all together. The components of the angular momentum 


are themselves a vector operator, since the expectation values of them must 
transform like the classical angular momentum vector. Putting K — J we see 


i, Jį] = iħeijkJk (8.137) 


and we have retrieved our previous commutation relations. 


Rotation of Spinor Wave Function 


The spin $ wave function with two components uses the Pauli matrices as 


h 
Si = 77 (3.138) 


to describe the angular momentum. 
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Thus we get 
; i0 
Va(x) > W(x) = exp ET (L+ 9| Va(x) (3.139) 
id f a 
and we will just concentrate on the matrix factor. 
Notice that 
n.o mn-o —ninjojoj (3.141) 
= nn {dij 1 + 1€;jKOK} (3.142) 
so that 


exp (-52 : a) = COS (5n . a) —isin (2n : a) (3.144) 
6 
= cos (5) —in-o asin(5 ) (3.145) 


s. Wa(x) > E —in- asin ($ )| yel R !(8)x (3.146) 


We first introduced our angle 0 to rotate a vector, and we expected a rotation 
of 27 to take us back to the start. Here we see that 


m = =) (3.147) 


and we seem to have to go round twice in ordinary space to get back to w(x) 
again. No physics goes wrong, because we always have bilinears in y to de- 
scribe physical quantities. This is what distinguishes spinor "representations" 
of SO(3) from vectors. (The spinors of SO(3) are vectors of SU(2).) 

To check that we are not missing the point, we should do the finite angle 
transformations of both spinors and vectors in a full and related way. We shall 
need some more powerful machinery. 
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Orbital Angular Momentum (x x p) 


This is where the story usually starts. One way to represent the angular mo- 
mentum algebra 


[Li, Lj] = i£ijkLk (3.148) 


is to consider position and momentum vectors in a [3] space, with an uncer- 
tainty relation 


[xi, pj] = ii (h) (3.149) 
Li = eijkXj pk- (3.150) 
Then 
[Li, Lj] = &ipq€ jimlXp Pq, Xi Pm] (3.151) 
= Eipg € jlm{X pL po, X1] Pm + XiLXp, Pm] Pa} (3.152) 
= 18ipg € jlm{—Xp Pmdql + X1 Pq Spm} (3.153) 
= i Xa Po{—Eiasé jsb + Sisb£ jas} (3.154) 
= ix, po{3/5° — staj — 6/8? + tal] (3.155) 
= i Xa PbEijkEkab (3.156) 
= itil (3.157) 


as required. Of course, you cannot represent all cases (add half-integers) this 
way, but we shall see this in detail later. 
As you know, the Schródinger representation is the one in which 


&—x (3.158) 
p -i(y)v (3.159) 


but, although this specifies the concrete representation 
9 
L; — -ihegx; y (3.160) 


it does not reveal the full significance in three dimensions. To do this you 
move to spherical polar coordinates. 
By thinking about small angles 


e, = sinf cos fi + sin0 sin ġ j + cos 0k (3.161) 
eg = cos 0 cos gi + cos0 sing j — sin 0k (3.162) 


ep = -singi + cosój. (3.163) 
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These e's are not fixed. An alternative (instructive) way to obtain these results 
is to note that 


d 109 1 ə 
V= n 3.164 
Vora, thar ag + C^rsino oó eed) 
(watch the order) and to act on 
r — re, = ([sin0(i cos + j sing) + k cos 6}r (3.165) 


with e, - V and e, - V. 
Now, if you try to work out L; in terms of spherical polars it soon gets hard. 
The example 


ð oxd ðyəð aa 


= 3.166 
od a6 ax | ab dy | aba ( ) 
9 9 
= —r sin 0 sin $ — + r sin 0 cos $ — +0 (3.167) 
ox oy 
9 9 ə 
= —y— + x— > L, = —i— 3.168 
x + iT => ie ( ) 
is the only easy one. Try it and see. Instead you say: 
L=rxp>-irxVv (3.169) 
—-—ire,x V {£££} are right-hand set (3.170) 
.| —€, 9 d 
= = 3.171 
EA up 
i (cot cose 2 + sing?) 
a a od (coté sing 2 — coso 2) : (3.172) 
9 
*(-$) 
Hence 
.9 
L.--i—, (8.173) 
ap 
as before, while 
js f 9 ə 
s HG) 7 dna P 
L,-L,ciL,- e (5; £icotéi). (8.174) 
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Again, 
5 1 
L S LLys z(L+L- +L_Li)+Lz (3.175) 
= L? +( Part of L,L_ even in 0 > —0,¢ > —¢) (3.176) 


2 


2 + “Even t" 
=- ar 
39? P 


SUE esa d 7720 P 9 
nri) X EC LI (3.177) 


d “E wy 
= 7392 + “Even part 
Ə lð; 9 
F (2 i coto È) 
jf doa 9 
of : ie? (2 i coto 2) (3.178) 
—e'?i cot 0 
-i$ d (ð ; 0 
+e m (2 i coto) 
a 
á ae (3.179) 
= ap 9 2929. f 
$ t+cot@s -cot Ong 
9? 9 1 æ 
= + cot 3.180 
p FO ag ma | (3180) 
9 9 1 sg? 
= in 9 181 
( f= a0 aan xxl e 


Note: The {e,, ej, €p} are not fixed, soto get this result by (—ir x V)-(—ir x V) 
in spherical polars needs V - e, and so forth. Try if you like; it is not easy. 

Now, what we have shown is that L; does not depend upon r at all. The 
action of the L; on the space is to induce rotations. We shall return to this later. 
Meantime, we ought to specify the representations. You should solve 


L;Y7(0, 9) = mX” (0, ¢) (3.182) 


L'Y"(6,9) =1(1+ )Y7(6, 9), (3.183) 


Tensors and Tensor Operators 


but L? is horrible and has two derivatives, so you solve instead 


LY", $) a mX” (0, o) 


Ly” ex my, 
9 m y m 
2g =i 
y" E cie"? p"(g). 


L,Y 20 


ool 9 .. d ; 
=> e? (5 +i cot =) eil? pl(g) =0 


P 
(= =i] coto) pl(o)—-0 


d Pl 
> —lcot0d0 =l 
P, 


d(sin 0) 
sin 0 


P} = (const.) sin! 8. 


Ls Yi"(6, 4) = VLF m)( +m +1)" (6, 9). 
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(3.184) 


(8.185) 


(3.186) 


(3.187) 


(3.188) 


(3.189) 


(3.190) 


(3.191) 


(3.192) 


(3.193) 


We know (by the orthogonality theorem) that these functions can be made 


orthonormal, and we normalize on the unit sphere 


T 27 
J| dosino f tees ovo 0) = di Sum. 
0 0 
Here we need 


2z - 
1= len ji ao f dé sin 0 sin” 6 
0 0 


= lcu 2x f d0 sin”'+! o 
0 


2 
= |ey|^27 Dia. 


(3.194) 


(8.195) 


(3.196) 
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But 
hi = — f sin” 0d(cos 0) (3.197) 
= —[cos 0 sin”? JZ + J cos 0d (sin?! 0) (3.198) 
0 
= f 21 cos? 0 sin?! da (3.199) 
0 
= 2y — 2l biy (3.200) 
paoa L2 QD! 
AUR UE RE EI TRUST TT 
(1? 
SE 3.201 
2+ een) 


(2'11)? Daley? 


“OED CLL! 202) 


Convention then takes cj real, and the sign (—1) to make YP(0, 0) real and 


positive, and we find 
—1y [QI 4-1) . 
y- -— Jt M )* 56 sinl 9. (3.203) 


The rest are now found by lowering: 


(1 4- m)(1 — m 4 1) 


" mM __ (J +m)! l-mvl 
n y= gmt”) yl. (3.205) 


The conventional connection between the spherical harmonics (Y/") and the 
associated (first kind) Legendre functions ( Pj”) is: 


1 
y = / Lys (3.204) 


„a 1QI 4-1) m) 
An (I + m)! 


[2+1 
w = ac P; (3.207) 
4x 


where the P; are the Legendre polynomials (the 0 index is conventionally 
dropped). 


ln 
Y” = (—1) reimp pmo (m > 0), (3.206) 


and in particular, 
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Now 
Y» = 2 (3.208) 
m rU. l 
1 3 ias 
Yi = —| —e’® sind (3.209) 
8x 
3 
Y= J> pal 
aye (3.210) 
ab 3 —i o: 
YY =, ze “sind (3.211) 
8x 
[a DRILEESXEE, ,,anHGGCGR4ÍÓI 
The Spinors Revisited 
Recall that 
v [ep (-59) "e (3.212) 
2 ap 
WPa — Ung we (3.213) 
Vi > Ri Vj; (3.214) 


are the transformation laws under an angle of magnitude 6 about the axis 


parallel to n. 
The projection operators 


1 
PY = 5(l£n-a) 


(3.215) 


have the usual properties in the spinor sector, so that 


0 0 
U —1cos (5) — io -nsin (5) (3.216) 


What are the related projection operators for the vector representation? 


as previously. 
Consider 


1 
(P); =-Tr. (P^o;PPo;) 
2 


(Dy = : XO Tr (P^oi P^o;) 


A 


Ax B (3.217) 


(3.218) 


where A takes the range 1 to 2, and i takes the range 1 to 3. 
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Use the completeness 
(ci)ag(0i) ys + Sapdys = 26550yg 
to write 
1 1 
z1" (o; X)Tr - (o; Y) = Tr - (XY) — 1T -(X)Tr - (Y) 


jT (o; Xo; Y) = Tr - (X)Tr - (Y) — str - (XY). 


Then 
p4 peP — ly p^ p? ly pc pP 
ya )kj = 2 rs ( Oj OK) 5 Y: ( Ok oj) 
1 
zo» -Tr (P oP 4a; er o2” 
no sum 
by Equation 3.220 
= 0if Az Band/orC + D. 
Again 


1 1 
(P* ile = gre: (P%0;P* ox) 7 Tr : (PS o% PPoj) 
— by Equation (3.220) 
— 0 because B Z C etc., 


and similarly I P — 0. 
Yet again 
1 1 
lily; = 5Tr - (P^ci Pox) Tr : (PPa PPo;) 
1 
= by Equation 3.220, Tr - (P^a;P^4o;) = Ij; 


because Y^ P^ = 1 and the o; are traceless. 
Finally 


5 (OP) + Ij = ; 
AZB 


so these are projection operators. 


(3.219) 


(3.220) 


(3.221) 


(3.222) 


(3.223) 


(3.224) 


(3.225) 


(3.226) 


(3.227) 


(3.228) 


(3.229) 
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Dimensions of Projected Spaces 


These are given by traces. Tr - (P48) = by Equation (3.221) = 1, so this is one 
dimensional and there are 2(2 — 1) — 2 of these projectors. 


Tr. (I)= rr : (P^o; P^o;) (3.230) 
= Tr-(P4)Tr -(P4) — str (PAPA) (3.231) 
by Equation 3.221, 
2 1 2 
-»üx10)--»(022-1-1 (3.232) 
A=1 2 A=1 


as expected, but this space is not irreducible. 
In our SU(2) notation we find 


= 1 M 
CES = z1" . (P^ojP*oj) (3.233) 
1 tex) 
= ;Ui —ninj F 1€ixjMk] (3.234) 
1 
and (ij = z1" . [(P*o;P*oj) + (P~o;P~o;)] (3.235) 
1 
= Tr (sie) - (PY + P) (3.236) 
= bij — (óij — ninj) = inj. (3.237) 
ee 


Connection between the “Mixed Spinor” and the Adjoint (Regular) 
Representation 


Recall that the product of two spin } states gave a three-dimensional vector 
and a singlet. When the singlet has been removed by the trace, we can write 


1 


Va = V; Oi )a 3.238 
Ens (9%) ap ( ) 
and the inverse 
1 
VW = —Tr -(Vo;). 3.239 
75 (Voi) ( ) 
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Now when the spinor transforms as 


Wa — Uap Vg (3.240) 
then we see that 
1 
V, > c T, SOLVE, 3241 
ar ) (8.241) 


is the transformation of the vector. Expanding for small angles we see that 


1 
uU -1-ji6.c (3.242) 
2 1. 

U =1+ 518-0, (3.243) 

Eye (Vu-!o;U) (3.244) 
J2 
= V + Ok£ikj Vk (3.245) 


to first order, which we hope is familiar. We would like the full version of all 
this. Returning to 


1 
V; > —Tr . (UVU- o; 3.246 
gr ) (3.246) 


we use Equation (3.220) from the completeness relations to establish that 
Vi > Rij Vj (3.247) 
Ue str : (Ul oiUo;). (3.248) 
It is straightforward to use Equation (3.220) again to show that 
Rij (RT) jk = 8 jx, (3.249) 


where the T indicates transposition of the matrix, so that R is, as expected, 
actually orthogonal. 


Finite Angle Rotation of SO(3) Vector 
We have just established that 


0 
U = exp |-zm + i5P-| (3.250) 
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and that 
Ry- lie bj" exp] 5106) -on |; ese 5-0 - 0) 
(8.251) 
= ninj + (ij — ninj) cos0 — ieiyjnyi sin (3.252) 
R(n, 0);j = nin;[1 — cos(0)] + 8jj cos(0) + eic; sin 6. (3.253) 
This is often written in the form of dot and cross products as 
V— V'«Vcos(0) + [1 — cos(@)](V - n) + sin(0)n x V. (3.254) 


(If you are not familiar with this result, it could be checked by using the [cubic] 
characteristic equation method.) Note that when 0 — 2x then V’ > V. So 
the rotations really do what we expect and the spinors of SO(3) really do a 
strange double cover transformation. 
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Problems 


3.1 Read through from Equations (3.4) to (3.8). Now close the book and 
try to rewrite them. 


3.2 Explain what is understood by a contravariant vector. 
3.3 Explain what is understood by a covariant vector. 


3.4 Show how to express a tensor T into symmetric and antisymmetric 
parts. Using the Levi-Civita tensor show how to write the antisym- 
metric part as a covariant vector. 


3.5 Takingacontravariant vector as the example, show how transitivity 
works. 


3.6 Using your own notation, show that for rotations in three dimen- 
sions, covariant and contravariant are equivalent. 


3.7 Explainin your own words what is understood by a scalar operator 
and a vector operator. 
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3.8 Show in terms of Dirac's state vectors that operators representing 
transformations on the space formed by them are unitary. 


3.9 Show how to express the operator R(e,, 0) as a 3 x 3 matrix. 


3.10 Using your own notation show how to build up a rotation operator 
in three dimensions by an angle about a fixed vector axis. 


3.11 Take the rotation operator you constructed in the previous problem 
and by rotating a vector operator K; and then expanding to first 
nontrivial order in the angle, find the commutator of K; with the 
components of the angular momentum. Hence, retrieve the familiar 
commutation relations of angular momentum. 


3.12 Using your own notation show how to describe the angular mo- 
mentum of a spin i wave function. Use your result to find the2 x 2 
transformation on a spin i wave function. 


3.13 Using your own notation, show how to find the commutation rela- 
tions of components of orbital angular momentum for the integer 
cases. 


3.14 Using spherical polar coordinates, work out the operator L,. 
3.15 Repeat Problem 3.14 starting with L — r x p. 


3.16 Using your own notation work out the correctly normalized form 
of the spherical harmonics Y], Y?, and Y, !. 


3.17 Show that the familiar operators P^ = 1(1- m. c) are indeed 


= 2 
projection operators. 


3.18 Show what is understood by completeness of the Pauli matrices 
and the associated unit operator matrix. 


3.19 What dimension do the state vectors projected by the operators in 
the previous two problems have? 


3.20 The product of two spin i states gives a three-dimensional vec- 
tor and a singlet. When the singlet has been removed by tracing, 
show how the components of the three-dimensional vector can be 
expressed in terms of the trace of the matrix and Pauli matrices and 


also vice versa. 


4 


Special Relativity and the Physical 
Particle States 


The Dirac Equation 


We look fora Hamiltonian to use in the time-dependent Schrédinger equation, 
and because this is first order in time we stay first order in space trying to 
treat all coordinates on a similar basis. Write 
9 ; 0 ; 
[S pe gt Yea debo (4.1) 
ot ox! 
where y is now a column and the a! and f are square matrices. 
We require p? = m? or the Klein-Gordon equation (O + m?)y = 0 but we 
actually have 


u ey aaltala oy 
of? 2 axidx/ 


RT j, 9 

imf epa) tnp 4D) 
by iteration (or squaring) and using symmetry in the second term. Hence we 
require the algebra 


0 


ala! tala! = 28 
o! B + Ba! =0 
(a')? 21-2 g?. (4.3) 


Moreover if H is Hermitian, then so are o/ and f. 

Now any matrix whose square is 1 has eigenvalues only +1. (Proof: My = 
aw with real a because M is Hermitian. Thus y = M?^y = Maw = a7.) Also 
B and a are traceless. (Proof: Tr(a’) = Tr(82o/) = Tr(Ba! B) = Tr(—a/).) 

Hence since the trace of a matrix is the sum of the eigenvalues, there must 
be an equal number of +1’s—these matrices are thus of even dimensions. 
Clearly 2 x 2 is too small—only three Pauli matrices but at least four Dirac 
matrices. We shall try 4 x 4. 

All possible products will now therefore yield (up to) 16 independent ma- 
trices that are Hermitian. But for reasons that will become clear (covariance 
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with pseudo-orthogonal Lorentz group), we choose to work with 16 that have 
a pseudo-Hermitian property. Define 


y°=B and y'= fe (4.4) 
so that the Dirac equation now reads 
9 9 
š 09 eM = 
ih ey eem 
or iy"Qo,y — my 
or igy — my (4.5) 


where we have written y" p,, =~ = p"y, for p" any vector. 
Notes: (a) If p" = io" — (y — m)y = 0. (b) Despite the index notation the 
y are just four matrices (similarly y, = guy”) and they never transform. 


The Clifford Algebra: Properties of y Matrices 
The Clifford algebra now reads 

iyu mt (4.6) 
where 1 has been suppressed on the right-hand side. All the y’s are traceless. 
y’ is Hermitian and the y’ are anti-Hermitian. We have four of the y’s and the 


unit matrix so there are 11 more objects to find. First look at the commutators 
of the y's: 


i 
s" = ly" y'] (47) 
and this gives us six more because of the antisymmetry in u ex v. 

Clearly the o'/ are Hermitian and the o? anti-Hermitian. All o“” are trace- 
less because any product of anticommuting matrices is traceless. (Proof: 
Tr(AB) = (—)Tr(BA) = (—)Tr(AB). This assumes finiteness.) 

anti cyclic 

Now because each of the y^ squares to 1, we can see that the last five 

matrices must be four of the type y°y!y? (i.e., missing one of y" each time) 


and with y?y!y?y? as the final one. We define 
g^ =iy"y? = SAY yoy (4.8) 
where using y, y" = 4 
5 go 
y= Al VuVvVoVi- (4.9) 


Notice that £07? = (—)1 so that y? = (—)wvinys = y?yly?y? = yoy?yly? 
and so forth. Clearly, by looking at the 3y and 4y expressions, o? are Hermi- 


tian, but y? and o® are anti-Hermitian. Again, (y^, y?) = {y“, y9y1y?y?] 
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and with three of these y^ anticommutes but with itself it commutes so 
that 


tv“, y°}=0. (4.10) 
Thus 
i 
g^ = sly", Yl (4.11) 


which fits with the previous notation and also shows that Tr(c^?) = 0 because 
it is the product of anticommuting matrices. 

Finally y? = y9(yly?y?) = -(yly?y9)y? so that y? is traceless too. 
Moreover 


ypr=avyyyyyyly® =-1 


‘fy, y} = =2, (4.12) 


You now have 16 matrices, all Hermitian or anti-Hermitian, and all traceless 
except for one. 

To establish that they are linearly independent you work out the full mul- 
tiplication table (Figure 4.1). 


g?^ oP y 
0 0 0 
g?^ oP y 
ilg y>- gey] |ig^^y? -i o" 
g P9 Ogs Ys HPB o " 0 
ilg oYP — gh? g 
1|O' [e] 
E TTE aeos- ^ jo 
mo gil? — o?" g^ 
gg ggp | ^ Du geb c 
uvpà 
€ 
+ gio 5 Yo 
Y Oop 
ig"? -iy" 
vr epe el agp 
0 
-1 


FIGURE 4.1 
The y matrix product table. 
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Structure of the Clifford Algebra and Representation 


If you examine the table (Figure 4.1) you see that the square of each of the 


16T” is +1. Indeed, you can see that (TR)! = (rA)! for all R. Moreover the 
product of any two classes 


reps = pFer? (p are complex numbers; each is +1 or ci) (4.13) 


in such a way that there is only one term in the sum, and only one appears 
when you have a square. 


1. The 16 T? give a linearly independent basis. 


Proof: If crr? = 0then multiply by 5 and trace 0 = cg Tr(T5T 8) = 
Xs4£scs — cs = 0 where we have defined (C)? = £g1. We see that 


Xsr^-(r)'-(r5), £s = +1 (4.14) 
is another way of writing this. 
Xg£gp?*? —Xo£op 957 
Proof: rer’ = perp 
(TEx) &gD5 =A p? TOT? 
(xT) Eor T? -Xo rp? TOER 
(There was only one term in R “sum. ”) 
(Ero) Arérr T? —Xo&op*?^r? 
-.Xgkoén p?" 9r? —X,&opC? TS 
^Xiégp?^? =KogsopO%. (4.15) 


. If there are two representations of the algebra of multiplication, T8, 
which is N x N, and y, which is n x n then there is a matrix linking 
them. 

Proof: Let F be an arbitrary N x n matrix and define S = £g FF y? 


where the triple index is intended and there are 16 terms in the sum. 
Then 


ies) 


Sy5 = Eg RE pR52y2 
= &gF9Fp95RyF  (relabel) 
= Eg p^ OT 9F yk (by 2 above) 
— l?£gP AF y^ (same algebra for y and T) 
- D$. (4.16) 
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Notice that F is quite arbitrary and since the y's are linearly inde- 
pendent we can always find an S which is not identically zero (e.g., 
take just one entry of F # 0). 


. Up to equivalence there is only one irreducible representation of the 
y matrices. Up to a constant factor the matrix giving the equivalence 
is unique. 

Proof: We have to invoke Schur's lemma; we will prove this on the 
way. The first part of Schur's lemma states that if the representations 
are irreducible (i.e., there are no nontrivial subspaces; the identity 
and the whole lot are trivial) then S is either null, or is nonsingular 
with N — n. 

To prove this note that I' and y induce linear transformations in the 
vector spaces Py and P,. But S maps P, into a subspace (we assume 
N > n without loss of generality) P of Py by 


S:P,2P=sPCPy (SisNxn) 
and this P is an invariant subspace of Py because 
T5(P) = PSS{P} = Sy*(P) = set multiplication S(P) = P 


so the irreducibility implies P is either empty or identical to Py. But 
we know that S is not null, so Py = S{Py} is a subspace of P, while 
n < N; clearly n = N. Moreover S has an inverse, because if it had 
zero determinant it would pick out a subspace. More precisely, if S 
is singular then at least one eigenvalue is zero, so let (P?) be the set 
with S{P°} = 0. Then 


Sy®{P°} = r¥&s{p°} = r’o =0 
y Osa eco 


and (P) is an invariant subspace. 

But S is not null (which could be the case if all the eigenvalues were 
zero), hence we have a contradiction, so S is not singular. 

The two representations are equivalent by 


rs = Sys! (4.17) 


and we can now show that S is unique up to a multiplicative factor 
by using the second part of Schur’s lemma: If a matrix commutes 
with all members of the irreducible set then it must be a multiple 
of the identity. The proof of this starts from [y^, M] = 0 so that 
[v , M - 11] 2 0. 

But take à such that Det(M — 41) = 0, then it picks out a subspace 
unless it is null, so Schur's lemma is saying M = 41. 
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So now, if you have two matrices S and S', which both give the 
equivalence 
rA = SyS! 
Dee 


then S-'S'ySs’-!§ = g-1p$g = y’ 


^ [StS y5] 20 
Sg 22A 
“SHAS. 


5. Completeness. Consider the section above when T? = y? and there- 
fore 5 = A1. 


Then 66 = SÊ = &v (FP) 9 
and tracing = 4A = &k(y ); F/ (y^) 
= ERERI) FY 
= 166, Fy. 
Hence A = 48, F} and substituting 


45° FSB = £g(y ) FI (y ^ 


H^ vta 


and this is true for arbitrary F implies 


1 
S 6trly (y E = 4588". (418) 
R=1 


Lorentz Covariance of the Dirac Equation 


Recall that the rotations and boosts are specified by the coordinates of the 
transformed point being given as 


eS Q5" 
/ 9 v 
d = ox = Q9,. 


Now the Dirac equation reads (iy“d, — m)y(x) = 0 and if y. > w’ under 
Lorentz transformation, then (examining at the transformed point for conve- 
nience) we had better have (iy^9,, — m)y'(x’) = 0 to ensure covariance. We 
can rewrite this as (1p $279, — m)y'(Qx) = 0 and writing T” = y“Q?, we can 
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insist that T” = say's. because 
{Pe ay" Oe yP R2] 
= OPQ, 29% 
== 2g” 


and we have the same algebra and can apply the theorem. (Of course we donot 
yet know S.) We now have (iy^3,, — m) Sg J/(Qx) = 0 since S is nonsingular. 
So, to ensure covariance, we have 


w(x) > v'(x) = Sav (2x) (4.19) 


under Lorentz transformation. Any object that transforms in this manner is 
called a covariant four-component spinor. Of course, the 5, have to form a 
representation of the Lorentz group. 

Now, using Lie’s theorems again, we restrict to the infinitesimal form Q}, = 
ô, + @, +... and for the spinors we write 


Wi(Qx) = SÉ ra(x) 
eu ja zo (Q2 vao) T (4.20) 


where the six X"" have to be constructed out of the 16 matrices TÈ. 
Imposing 
A cÀc-1 
y^qQ = Sy*S 


implying that 


that is, 
Ow {[E", y*] — 2ipy^ g^ — y'g^*]] = 0. 
But the w,» are arbitrary, so that 
[x^", y^] = 2i(y" g^ — y's) (421) 


and from the tables X^" = o*” is a solution of this. Notice that for proper 
Lorentz transformations we require det 5 = 1 so that the o””, being traceless, 
are appropriate (i.e., do not take the solution c^" + 1). Also, if there is another 
solution, then the difference commutes with all y^ = commutes all TÈ => is 
a multiple of the identity. So we have the solution. 
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The Adjoint 


We know that some of our [È are Hermitian and others anti-Hermitian. 
Notice, by inspection, that 


yR = yyy? = y? (4.22) 
which is why I have adopted the current conventions. In particular 


ott = yog", 


4 4 
then S! 2 y9s-71,9 


; i i 
and since 5—1-— —-a,,0"" +... — exp (- joe") 


by inserting lots of 1 = y°y° in between terms of the expansion. Another way 
of saying this is 


S=y%sty® = S7! (4.23) 


and we note that S is unitary only if wo; = 0 leaving only the rotations described 
by @;;. (This is no surprise; we noted the pseudo-orthogonality before. This 
is a finite dimensional representation of a noncompact group.) 

It is now obvious how to construct covariant quantities from V! and we 
also now see why we put indices on the constant y" matrices. We introduce 
the adjoint spinor (contravariant) V by 


T” = wl (ye or Y= vip. (4.24) 
Then when 
w= Sw 
wt = ytst 
but V = Y5 = YS! 
thatis, V" = Y (S) = Y (S92. (4.25) 
Now 


SyS! = y"Q", so 
OY Sy" S = y” 
z. STIVE S = Qy” (4.26) 


and hence Vy" V is a contravariant four-vector. 
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This extends under space-reflection 


x (-)x 
t>t thatis x" —X,g"" x", (4.27) 


which you recall has to be adjoined to our proper Lorentz transformations, 
and we have 


V — lY with v > YP = YP! (4.28) 


exactly as earlier. This time I'" 2X, g^ y^ = y^ = P -!y" P, which has the 
(unitary) solution P = e/?y? and if P? = 1 then e/? = +1 so usually 


P= (4.29) 


is the standard choice. u 
We find that all the ^ give us VIF to be real (Hermitian later when 
operators), with 


Vy scalar 
Wy? pseudoscalar 
Vy vector 
Vo'^y  pseudovector 


Poy tensor 


fl 
The Nonrelativistic Limit 


Suppose we go to the rest frame of the electron, so that 


V 
d = HW > myy (4.30) 


then we identify 


wae Me yit (4.31) 
w= e"n yn — (—)n (4.32) 
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as twosets of solutions. jr that E isa projection operator for the positive 
energy solutions and + 7" for the fcc 
P,+P_=1 
Pe Pac PP 
P. + P. “= P. + 
PP P 


There is a representation of the Dirac matrices where 


p= 1= : 9 (4.33) 
= T4 X = 0 1 is 
0 o 
a= ( j —nuxe (4.34) 
so that 


o 5, [li 0 
y sra 4) (4.35) 


y! — po! = 1T2 x g! = ( " ) : (4.36) 
—o! 0 


In this representation, (x )= d aa and (4+ yey = s 2) so that v — (|). We 


refer to upper and lower components. 
Now in the rest frame, H — my? commutes with 


n 25:6 o; 0 
i _ eijk jk _ 1 : 
Y= 25 rtp = (; ) (4.37) 


therefore so do (+ —5?). So we can further clarify the solutions by using the 
projection operators. 


Poincaré Group: Inhomogeneous Lorentz Group 


Restrict your attention to [4] of space and time. Write 


x" = (x, xl, x?, x?) = (t, x, y, z) = (t, x) = (t, x!) (4.38) 
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where t is really ct butc is set equal to unity. The Lorentz transformations are 
characterized by 


5 (4.39) 


uv 


leaving invariant the quantity 


dt? — dx? — dy? — dz? = g, dx" dx" (4.40) 
go = 1 

where Sij = (—)ôij . (4.41) 
Qo = 0 


(Note: The relativists, who need $,, for a metric in curved space, use n, for 
this.) 

What are the consequences of leaving this second rank covariant metric 
tensor numerically invariant (form unchanged)? Well you can write this as 


ax’? ax’* 
v = 8o m 4.42 
Suv = Spr axe 9x" (4.42) 
and if we differentiate with respect to x7 > 
Ox? ax’ ox? atx" 
0- 4.43 
^| Ox"Ox" Ox" | Ox" Ox"Ox" | ae 
We rewrite this to emphasize the symmetry as 
0=g{52,D," + D3” Sha} (4.44) 


then switching u <> x and adding, and switching x <> v and subtracting > 


0- Sx[55D; ^ +D” S + get TN. -S2 RIP — DS} 
(4.45) 


where the cancellations use the symmetry of both g,, and 57... We now have 
0 = 2gp Sf, D7” (4.46) 
but both 2,4 and D; '^ are supposed to be nonsingular, so 


ax'P 


— pi. = 
0 = Shir = OX OX ' 


(4.47) 


This is solved by having x’ be “linear and constant” so that 


xt = QU x" +a” (4.48) 
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where Q} and a" are constants. The constant metric condition now reads 
Sis = Spa O4 A. (4.49) 


These transformations form the inhomogeneous Lorentz group or Poincaré group. 
We now look at the subgroups of this group. The a” parts give translations; 
they form an abelian subgroup. 


Homogeneous (Later Restricted) Lorentz Group 
Dropping the translation terms we have 
x! = Qr (4.50) 


where the $2" are just numbers, and real ones at that. Notice that the x^ 
themselves now form a contravariant vector. We had 


Bi > Bi(D")i, (4.51) 
so that we identify 
Of sp s, (4.52) 


Recall that we can raise and lower indices with the numerically invariant 
metric tensors ¢,,, and 2"" and notice that 2"" has the same numerical entries 
as Quv. Notice that 


x seg" 453) 
= (gah, Q^)g'" by invariance of guv (4.54) 
= gpa Rig” (no change; just a suggestive rewrite) (4.55) 
= 200% (4.56) 


where we have raised and lowered indices on the final symbol. (Note: The 
position of these indices (as first or last) now has to stay fixed.) But this can 
be rewritten 


a _ —1 a 
ôn e(D o (4.57) 
so we identify 
D% = Q5. (4.58) 


The equation that expresses the fact that 2,,, is from invariant, which can also 
be viewed as a restriction on the allowed elements of Q or D, can be rewritten 
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in many ways: 


Suv = Spr 92592 (4.59) 

or QU = 3e (4.60) 

or QQ = g" (4.61) 
or Q'g,sQf = gus (4.62) 


expressing that it is pseudo-orthogonal. Note that (usually) when authors 
specify a matrix form they mean 


y = guy" (4.63) 
x? x9 
2 y = l Matrix | x! (4.64) 
xe Specified | | x? 
x’ x? 


and if you slip up on this point, then you will get the wrong signs. 
Now consider the last form above as the matrix equation 


Q7eQ =¢g (4.65) 
and take the determinant: 
(Det Q)? = 1 (4.66) 
z. Det Q = +1. (4.67) 
If Det Q = 1, we have proper Lorentz transformations and if Det Q = —1, we 


have an improper one. 
Again, look at the (0, 0) component of the same form: 


QI Qj — 1, (4.68) 

that is, 
(Q0) — x; (Q5) — 1. (4.69) 

Hence 
(29)* > 1. (4.70) 


If Q9 > 1, we have an orthochronous transformation, and if Q9 < (—)1, we 
have a nonorthochronous one. 
There are four sectors: 


1. £? with Q9 > 1 and det Q = 1. Proper, orthochronous subset. These 
transformations include 1, and clearly form a subgroup. We call this 
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the restricted homogeneous Lorentz group. It is characterized by six 
parameters—three for rotations and three for boosts. 

2. LÌ with Qi > land det Q = —1. Improper, orthochronous subset. 
Not a subgroup. Every Q in this set can be written as a product of 
one in Z/. with the space inversion. 


t 
Q(s) = , that is, l 


-1 


3. £} with Q0 < (—)land det Q = —1. Improper and nonorthochronous. 
Not a subgroup. Every Q in this set can be written as a product of 
one in re with the time inversion. 


—1 
1 poe eE 
Q(t) = ,  thatis, 
x 


1 


4. LX with OF < land det Q = 1. Proper, nonorthochronous. Not a 
subgroup. Every Q in this set can be written as a product of one in 
LÌ with the space-time inversion. 


-1 
—1 i i ay r) 
Q(st) 2 Q(s)Q(f) = (-)1— ie., 
-1 x > (-)x 
zx 
(4.73) 


Notes 


1. We usually work with £? and handle the reflections separately. 
2. Vectors have an invariant length under £. We call a vector 


time-like if x? > 0 (e.g., energy and momentum of a free massive 
particle) 

space-like if x? < 0 (e.g., momentum transfer) 

null or light-like if x? — 0 (e.g., energy-momentum of a massless 
photon) 


3. Under £! the sign of the time component is invariant as well as the 
length of the vector. Then both @(ko) and 4(k? — m?) are invariants. 
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Future light cone Positive 


Slope is 1/c = 1 
> X 
Past light cone 


Negative 


Xp taken 


as origin 


FIGURE 4.2 
The light cone of P. 


The invariant volume-element in momentum space is 


act 76 (ko)5(k? — m?) 2x = EE 7 (ko) 5 ([Ko-- v k? + m? | [ko — Vk? + m?]) 


(27)* (27)* 
(4.74) 


dòk 1 E 
= (2x)? 2E, where Ey = HJ EE +m 


(4.75) 


Notice the 2E; particles per unit volume (a relativistically covariant 
statement) and remember to have (4) and 276 to get the factors 
correct. 

4. Light cone of P is the set of points such that (x — xp)? = 0. See 
Figure 4.2. 


5. Obviously, with the structure of A-D, a Lorentz transformation is 
orthochronous if and only if it transforms every positive time-like 
vector into another such. 


6. Watch the signs! 


p" — (p^, p', p’, p?) = (E, p) (4.76) 

pu = (E, (—)p) = (po; puis Je Comes (4.77) 
(Authors vary.) 

pP’ = pup" = PMP’ Sy = E* — p'(— m) (4.78) 


= (t, x) and x, = (t, —x) so that 


p:x—p'x,—-Et—p:x (4.79) 


86 Group Theory for the Standard Model of Particle Physics and Beyond 


9 9 
E Su (=. +y) but it is covariant (4.80) 
while 
9 oig j 
9^ = (> -¥) but it is contravariant. (4.81) 
(Really we are mixing vectors and forms.) 
Then 
av? av av? 
VZ = V-V 4.82 
i G.A Ok s 


however, the D'Alembertian is 


= 3,0 = —-V?. (4.83) 


In the Schrödinger representation, with h = 1 = c 


E i (4.84) 
at 
p> (-)i (4.85) 
f =id, =i i 4.86 
ears Pu =1 yT toya ( * ) 
à . 0 
and p^ —id = E (4.87) 


n 


7. Rotation subgroup 


100 0 
0 : 
Qh > y > with MM! =1. 
0 
8. Boosts 
dx'* = OG dx?. (4.88) 


Here dx = 0 and dx’ and dy’ are zero so 


dz = dt (4.89) 

and dt’ = Qhdt. (4.90) 
dz 3 0 

Now V = — > % =vQo.- (4.91) 


dt 
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* P at relative rest 


= zZ 


P nowat x’ 
e 


Velocity V 


»-Z 


FIGURE 4.3 
P at relative rest. P now at x’. 


But recall that 


(29) - (9) =1 


Q0 -1for B. 


The solution is 


Q0 = cosh0 
with v = tanh6. 


Q$ = sinh 0 


thatis, z’ = zcosh0 -- t sinh 0 
t = tcosh0 + zsinh 0. 


Often we call 


Bp=v0 (= 2 of course.) 


" 1 
VENTER 


z = y(z + t) 
t = y(t + Bz). 


so that 


(4.98) 


9. A moment's thought should convince you that a general Q in £LÎ can 


be written in the form 


Q = Q(Rg)2(L3)Q2(Ra). 


(a) Rotate the z-axes parallel. 


(b) Boost to the desired velocity magnitude. 


(c) Rotate to final direction. 


Note: There is always ambiguity, and conventions are needed. 


(4.99) 
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The Poincaré Algebra 
Recall that x” = Q# x? + a" and work with the infinitesimal form 

52) = 8h + owk 

g^ = gh (4.100) 
so that the constraint equation 

Suv = $2,950 
yields 
8 = So [B; + oi] [8 +o] 
= Suv + gov, + Sur, Tee 

and thus 

Wuv = (—)Ovp (4.101) 


follows at once. At this stage it is easy to see that £” and œ, give you 10 real 
parameters. 

Now expand an element of the group as 
ae 5 ap M** 4 ie, P^ (4.102) 
where M*? is antisymmetric. The operators M^? and P" are generators. 
(Notice the sign in the last term. If we wish to view &* = (¢°, e) as the shifts, 
then g ~ 1+ie°P° — ig. P = 1— &09, — £ - V as required for V(x) > gW(x) = 
V(T !x) as required in the Schrödinger picture.) 

Now we have 


x" = x" olx" + e" (4.103) 
= x" + ogg" "x? + yg". (4.104) 

1 
SX" Lx" z2aplg" x^ — 9" x*] + e, gh, (4.105) 


by using the w,,, antisymmetry. Hence, from this defining representation we 
identify: apply g to wave function, not to coordinate. 


pe cip 
Me? = i(x*9 — xP3?). (4.106) 


Note again the remark on signs made previously. 
We can then work out the algebra directly: 


[P^, P"] 2 0. (4.107) 
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Again: 
[M"", P^] = [i(x"9" — x"9"), io^] 
= gia” — gat 
= i(g'? p — go pv) 
and hence 
[M"", P^] = i(g"? P^ — g"? p")]. (4.108) 
Finally 


[M^", M = (i) (i) px"^9" — xa, x^^ — xo^] 
= —{x greg — xH gha — x" gPa 4 xv git ge 
— x^ g^" + x^g" 9l 4 x^ gena? — y^gevgn] 
= gP (xd — xa”) + g""(—x^9^ + x") 
+ g(x" — xP aH) + g^ (Lx? + x^9") 
= i(g^? M^" +g” Mur + g^ MP" + g^ MP} 
so that [M"", M^'] = i(M'^g'? — Mg” + M” gh* — M"^g"^). (4.109) 


These three sets of commutators characterize the Poincaré algebra. 


The Casimir Operators and the States 


Iam sure that you already know that the particles are characterized by mass 
and spin. Our first task is to exhibit these features. We start with the observa- 
tion that 


P? = P,P” (4.110) 


is a Casimir operator. Any Casimir for this problem has to be a homogeneous 

Lorentz scalar (or pseudoscalar) and thus be characterized by having no free 

indices. Then it must commute with all the P^, but this is obvious here. 
Now what about other things that commute with the P^? Well, recall that 


[M^", p^] = i(g"? P^ — g^? pv) 
so that if we define the Pauli-Lubanski pseudovector by 


Wt = set, MP (4.111) 
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where the Levi-Civita tensor is antisymmetric as before but with 


90123 = (-)1 
£0123 = 1 
then [W“, P"] — 0. (4.112) 


This follows from the abelian nature of the P^ subalgebra and the antisym- 
metry of e^"»^, 

Note: The minus sign in the relative numerical values of €^ ^ and £0123 is 
inevitable as a result of raising and lowering. You also now have to have 


0123 


52568788  — 0888 
Eup E? = (—) T (4.113) 


where the "extra" minus sign reflects the same remark. 

Now to return to our task, all we have to do is to make a homogeneous 
Lorentz scalar by contracting indices. Obviously W, P" = 0 so the remaining 
choice is W? = W,W" and we adopt this as our second Casimir. 

Now physics motivates us to label the states by all four components of the 
momentum. So the rest of the complete commuting set of observables (CCSO) 
must be in the little group of P" and thus among the W". (Detailed enumer- 
ation shows there is nothing else. Exercise: Try this for P^ = (m, 0, 0, 0).) 

But what are the commutation relations for the W” components? Well the 
W" form a (pseudo)vector by construction so 


[M'", WP] = iflg W^" — g"^W"] (4.114) 
simply by comparison with the P^ transformation law. Again 
1 
[W^, W”] = [sen Ms w 
1 ap v 
= xe ” P,[Mg,, W"] + zero 
E nafy p iUWs8y — W,ó; 
Ei wi Wed, — W, 55] 
1 ; APVv av 
= zile P Pa Wg — e" YW, Pa) 
so that [W", W"] = ie” P, W;. (4.115) 


On the face of it we are now generating an infinite dimensional algebra, but 
provided that these W^ act always on states of defined momentum we have 
effective closure. The precise little group formed by the W" depends upon 
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the momenta. There are two cases of physical interest. 
Palpa > = pulpu > with p =m’. (4.116) 


We go to the rest frame p, = (m, 0, 0, 0) so that the effective commutators are: 


[Wi, Wi] = ist PW; (W? = 0) (4.117) 
= imei We) (4.118) 
= imei Wwe (4.119) 
or 
i j aU WE 
E | Se (4.120) 
m m m 


which we identify as an angular momentum algebra; this is called an SU; 
algebra. Now we can define 


Wi 
S = — [rest frame, (4.121) 
m 
so that 
W? = (—)m’S?, (4.122) 


and the final projection of states is 
|m,(—)m?s(s + 1); p, 53 > (4.123) 
as you probably expected. 
PalPu > = pupa > with p^-0. (4.124) 


We go to the frame where p" = (E,0,0, E) and from the definition of W", 
using W" p, = 0 we see that 


WE = (W?, W!, W2, W°) (4.125) 


in this frame. The effective commutation relations are then 


[Wi, Wi] = ie//9* py Wy + iet? P3Wo (4.126) 
= ig" E((—)W") + i(—)e™9((-) E)(W?) (4.127) 
= iEeW — i EW"? (4.128) 
= i E[e/*W* — s3 W°] (4.129) 
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and written out in more detail these become 


[W!, W°] 20 (4.130) 
3 

Ue we -iwi (4.131) 
ws 

E w] = eiW?, (4.132) 


which we recognize as “translations in the (x, y) plane together with rotations 
about the z-axis" by the identifications 


w! =P! (4.133) 
W = PP, do not confuse with P! and so forth (4.134) 
F= MP? (4.135) 
so that 
[P!, P? = 0] (4.136) 
IM”, p!] = (CP? (4.137) 
[MP, P] z iP! (4.138) 


compares directly with our standard form for "Poincaré." This is called the 
Euclidean group in [2]. 

There are two quantum numbers to be assigned. We all wave our hands at 
this point and really appeal to the physics we observe. You setthe eigenvalues 
of P! and P? tozero (you might have expected continuous labels at this point), 
which leaves M? free to have an eigenvalue whereas this would not have 
been the case. With this in mind you note effectively now 


W” = Xp" (4.139) 


which is a covariant equation, since P" is a vector and W" is a (pseudo)vector, 
provided that A is a (pseudo)scalar. We call 4 the helicity. If n, is an arbitrary 
vector, with n, P” 4 0, then 


cla WE pP PM 


Ag = 4.140 
n, P^ 2n, P" ( ) 
and if we take it along the time direction, then 
Tea 
A= mue P M (— 4.141 
pe P MfC) (4141) 
1 eU MiK 
c REPS (4.142) 


|P| 2 


Special Relativity and the Physical Particle States 93 


Thus, defining 


(4.143) 


you have 


p= £L 
|p| 


and we speak of the component of the angular momentum or spin along the 
three-momentum. 

Now, so far you have added only one quantum number (with a couple of 
zeros to specify the class of representations). When the reflections are added, 
you connect opposite helicities and the state are finally labeled 


(4.144) 


Im? >= |0 |À]; ps 
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Problems 


41 Work through from Equation (4.1) to Equation (4.5) for yourself. 

42 Work out oa". 

43 Work out o/^y?, 

44 Using the Levita-Civita identities, establish y"y^y* = e"PY^y, y? + 
yg co reu qoe 

45 Work out y^o^^, 

4.6 Work out y. Hint: Use the ¢ form of y? and then y^ y, = 4. 

47 Work out oo”. 


48 Work out y"o^*. 
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49 Work out oo. Hint: Multiply by y^o*? from the left by y^. 


4.10 Establish several matrix representations and check against text- 
books. 


4.11 Establish several o x t forms where o and t are Pauli matrices, for 
the gamma matrices. 


4.12 (For the devoted only.) Try all the previous systematically with s 
space and t time indices. 


4.13 Using the gamma matrix multiplication table show that o'"" do rep- 
resent the Lorentz algebra. 


4.14 Show that 9, V^ = — +y- yV. 


2 
415 Show that O = 0,0" = 2. — v?. 
4.16 Confirm for yourself that [M"", P^] = i(g'? P” — g"? P”). 


4.17 Confirm for yourself that [M"", M^] = i{Mi*gr — M"? g”>+ 
M” gi _ M* gh), 


4.18 Confirm for yourself that [M"", W^] = i[g W^i — g"^W"]. 
4.19 Confirm for yourself that [W^, W"] = ie"? P, Wg. 


420 Confirm for yourself that if s? = s - s = 1, where s is called a “spin- 
polarization vector," that 1 (1-- 5:0) and $(1— $6), where ø are the 
Pauli matrices, satisfy that each of their squares is unity but their 
productin either order is zero. These are called projection operators. 
They and many of their generalizations are very convenient for 
doing calculations. 


5 


The Internal Symmetries 


This chapter deals with the internal symmetries of the standard model of 
elementary particle physics. We start by considering global symmetries and 
the associated local conservation laws. We have already met some of this 
structure before in the form of U(1) and SU(2) except that they now act on 
the fields that create and annihilate the elementary particles rather than on 
space-time itself. We take a global U(1) represented by 


U(1) = exp (Fon) (5.1) 


where 6 is a constant parameter and N is an operator in an internal space 
where its eigenvalues are the numbers associated with the physical states of 
the system or the quantum fields that create and annihilate them. This allows 
quantum numbers (e.g., charges) to be assigned to the states and fields. Now 
if 0 is indeed a constant then U(1) will produce global symmetries, which do 
not vary from place to place, or time to time, and so forth. The consequence 
of this is that there are local conservation laws, which force certain labels 
on the states and fields to either stay constant or to only change such that 
certain specific combinations are constant. This puts strict conditions on the 
interactions between the states. 

It may well be that you have no trouble in understanding what has just 
been presented. On the other hand, if you have no previous experience, it 
may be meaningless to you. With this in mind I offer two simple examples, 
quite unrelated to the physics, in the hope that they may help. 

The first one consists of a puzzle made up from the 62 remaining squares 
of a familiar chess or draughts board when two diagonally opposite corner 
squares are removed. You are also given 31 dominoes, each two squares by one 
in size in terms of the chessboard. The total areas of the mutilated chessboard 
and the full set of dominoes are equal. The question is whether you can cover 
the one object with the other without further mutilations of either. To solve 
this puzzle consider placing a domino over two adjacent squares on the board. 
Clearly it covers two squares of different colors. Again consider the color of 
one missing corner of the board and its relationship to the diagonally opposite 
one. Clearly, they are of the same color. So the task cannot be done. A discrete 
symmetry has answered a "state of the system" problem. 
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The next puzzle is quite different. You are given two identical wine glasses, 
one half full of red wine and the other half full of white wine. You also have 
a small ladle (or spoon). This puzzle comes in several stages. First take a 
spoonful from the red wine and transfer it into the white wine. Question: Is 
the concentration of white in the red glass greater than the concentration of 
the red in the white glass or are they equal? You should easily see that they 
are equal. (Try doing this as a piece of algebra; it is quite easy.) 

Now the process is repeated except that after the red wine is put into the 
white wine glass it is stirred with the spoon. Then the process is completed 
as before. What is your answer now? It is the same. (Try the algebra. It is 
harder but still possible.) Obviously something "deep" is going on. There 
is a conserved (unchanging) quantity involved. It is the difference between 
the amount of white wine transferred overall and the amount of red wine 
transferred overall in the opposite direction. This is clearly zero. You can 
work this outby algebra yet again, or observe that the initial and final levels in 
the two glasses are the same. This time a (continuous) symmetry has ensured 
that the result is unchanged. 

It is probably about time that we looked at the Noether theorem [1]. 

Suppose that we have a Lagrangian density 


L = L(Y* (x), 9, * (x)) (5.2) 


and that we write 


oy^(x)  oy^*(x) 
Ou — Ox^ 


= yx). (5.3) 


Then the Lagrangian L = f £d°x is invariant under the changes of fields 


ov^ (x) (5.4) 
and the current density 
» E dL a 
j"(x) = by definition = ——óy (5.5) 
owe 
has 9,j"z0. (5.6) 
The Lie algebra A 
[Xe Xo] = i feip Xi (5.7) 


where the fij are the constant structure constants, implies that 


Jp (x) = (~i) [xol V^ (x). (5.8) 


oL 
oii (x) 
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Invariance of L under A tells us that 
d J h(x) 20 (5.9) 


or s | rte Vol = 0. (5.10) 


Armed with this knowledge we now turn to the modern way of viewing 
forces. We start with the constant metric tensor n”, the components of which 
are given by 


gy? =1 (5.11) 


n =Ï =O0ifif¢j but lifi-j (5.12) 


and all off diagonal elements are equal to zero. (The Kronecker delta, 5‘), may 
be familiar.) 

There are two very useful properties of ô. Clearly 3/ A; = A, which 
exhibits the raising of indices. The student is urged to check the lowering 
of indices also with 5;;. Again 8) = 1+1+1 = 3, which is the number of 
spatial indices. 

It is convenient at this stage to introduce the properties of the totally anti- 
symmetric Levi-Civita tensor ej. By inspection 


gue m = ôi 8; mg” — ô; ô; ngom 
+ 8; "8j ng! — ój"8j ma 


ES "8, 15, "m. 8; "5j Is. n (5.13) 


Now do not throw up your hands in horror and despair. Let us pick ijk to 
be 123 and Imn also to be 123. (You must watch the order of indices because 
£jj, is antisymmetric.) We note that this checks the sign of the top term in the 
left-hand column, because £155? = 1 x 1 = 1 and 8/838} 2 1 x 1x 1 — 1. 
Now look down the left-hand column and notice that ijk are fixed but Imn 
are cyclic, so that confirms the signs of the two lower entries. Again look 
at the top of the right-hand column and notice that ijk are again fixed but 
one switch of a pair of indices (n <> m) has been made, so that confirms the 
minus sign. Finally, look at the two lower members of the right-hand column 
to notice that ijk are again fixed but the Inm are now just cycled, confirming 
the signs. This is the way to remember the full original formula. Do not just 
trust your memory. Use the symmetries and antisymmetries. Now consider 


£j £ "^, which is just the original expression we started from, but n has been 
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picked equal to k and is therefore now summed, leaving us with a reduced 
tensor with only two upper and two lower indices. It is simple to work this 
out directly: 
eje nt = 6; l8; me k- ój l8; ks, m 
+ ô; m5 kap! = 8; *3; ms, ! 
6; *8; 15," — 8; "5,18, * 


= 8/8; "3 — 5j!8;" 
F 8, "8j! — 8;'6;™ 
+ 8;"5;! — 8; "8j!3 
= $j18;" — 8; "3,1. (5.14) 
Again consider 


ljk l j jc 1 
eye = 0j 0j! — ôi 16; 


= 36;' — 8 
= 26;!. (5.15) 
Finally consider 
eje = 28;' = 31. (5.16) 


It always works to the pattern shown, and the final coefficient (here 3!) is 
always the factorial of the dimension of the case considered with previous 
ones going back up the list being the (dimension - 1)! and so forth as the tensor 
surviving has more indices. Keen students may try to work out the case for 
three spatial indices and one time index (the world we live in!) carefully 
watching the signs. 

Armed with this mathematics we turn to studying electromagnetism and 
local gauge invariance. Suppose that charge conservation has been imposed 
on the coupling of a photon (the electromagnetic field potential) for a charged 
particle. In terms of the 4-potential 


A" (x) = (®(x), A (x)) = g"" A (x) (5.17) 
where (x) is the scalar potential and the A(x) are the three components of 
the vector potential. The field strengths are defined by 


9 9 
PRS Ah A! 5.18 
9X, 9X, ete) 


and the electric field and magnetic induction in a noncovariant notation are 
given by 


E =(F", F®, F®) and B = (F”, F, F”), (5.19) 


respectively. You are urged to check this out for yourself, taking particular 
care with the signs. 
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The free Dirac equation of mass m reads 
(i hÝ —mc)y =0 (5.20) 


where Y = y"— = ey eV. (5.21) 
where fi is Planck's constant divided by 2z and c is the velocity of light. (These 
constants will be dropped in what follows, bringing us to so-called natural 
units for the subject.) In momentum space with 


p'—ih 2 (5.22) 
Xu 


the Dirac equation takes the form 
(f — mc)y =0. (5.23) 


You may well have met the concept of "minimal" substitution where the 
interaction of electromagnetism is introduced by 


(v P t4 z me) y =0. (5.24) 


In many ways it is a pity that this choice of sign was made for the charge 
on the electron (the choice is arbitrary) as it leads to endless confusion in 
conduction of currents down wires. There is no way anyone can change this 
after so many years of history—certainly not this author. 

From a modern viewpoint the coupling given follows from local gauge 
invariance, or gauge invariance of the second kind as it is sometimes called. 
However, we will start from global gauge invariance or gauge invariance of 
the first kind. Here there is for U(1) a parameter 0, which depends on neither 
space nor time. The electron transforms as 


y — exp (=) V (5.25) 
and the electromagnetic four potential transforms as 
A" — exp (z^) AY exp (5°) ; (5.26) 
so that taking the derivative form of the coupled Dirac equation 
(if — ÉA- mo)y =0 (5.27) 
and we see that it is unchanged. Curiously, perhaps, there is already informa- 


tion to be found here. Since electric charge is conserved, and the space and 
time dependence of the fields is local, then creation and annihilation of charge 
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must also simultaneously be local. However, this is not what we are looking 
for. Instead, let 0 depend on both space and time, so that the invariances 
are now local. This time the derivative in the derivative form of the coupled 
Dirac equation (Equation 5.27) seems to spoil the symmetry when it acts on 
the parameter 0. However, the local transformation of the four potential is 
then assumed to change and thus restore the symmetry. We take 


—i(x) 


w(x) > e? phx) (5.28) 


where we see that because of the x dependence of 0 we must now look to 
restore a local symmetry. If we can find this, we would be looking at abelian 
gauge invariance, because there is no nontrivial group theory involved. We 
take a Lagrangian density for the Dirac electron field interacting with the 
electromagnetic current of the form 


: F,E" —ewAw (5.29) 


L-y -my-i 


in the natural units of the subject where A = 1 = c. Here 
FHY = 3" A — d” A" (5.30) 
and we require the vector field to transform as 


0 
A, Ay i, 02 (5.31) 


so that F"" is invariant and then define a covariant derivative by 
e 


D,y = (a * i5 


A.) V, (5.32) 


so that we see that D, yy transforms in the same way as y 
Day > e T99p, y (5.33) 


and our Lagrange density is invariant with the conventional factor of 1 as 
stated. This describes a massless vector field (experimentally the photon), 
because the mass term proportional to A, A" is not invariant. 

We now move onto consider nonabelian gauge field theories. The extension 
required is to write 


u(x) > ei FB (x) (5.34) 


where the T; are square matrices satisfying the commutation relations of some 
nonabelian Lie algebra 


IT, Tj] = TT; — TjT; = ifi Te (5.35) 


The Internal Symmetries 101 


where the fix are the totally antisymmetric structure constants of the Lie 
group. For SU(2) these are simply the ¢;, Levi-Civita tensor components. By 
analogy with the previous case we introduce a covariant derivative by 


Diy = (v + 3T.) à (5.36) 


where g is the coupling constant (rather like e) and 6;(x) has been extended 
to have the same number of components as the adjoint (or regular) repre- 
sentation of the Lie group, for example, 3 for SU(2). Thus, there is now the 
corresponding number of gauge fields, and we adopt the gauge transforma- 
tion property 


A—A » i B9 


+ $ fao; At (5.37) 


to allow the gauge fields to cancel out the unwanted terms. To construct a 
gauge invariant Lagrangian for the gauge fields themselves we define 


Fe = FP’T, = —ig ![D", D"] (5.38) 
so that 
FY = 9^ A" — 9" A" + ig[A", A"] (5.39) 


where a total derivative has been dropped. 
Equivalently we have 


F” 29 A — 0 AP — g fin Ai Ar (5.40) 


and this is independent of the fermion representation. 
The transformation of F“” under the gauge group is 


F(x) > U(x) FE“ (x)U (x) (5.41) 


so that a gauge invariant Lagrangian Lym for the gauge (now Yang-Mills) [2] 
fields is 


1 
Lym = (st $ (Fu F”) 


TT 
zo) PUES (5.42) 


since the normalization of the generators of gauge group is conventionally 


given by 
TD 1 
Tri + | = s 4 
(5 1) 29i (5.43) 
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We can see that the gauge invariant Lagrangian for a Dirac spinor field inter- 
acting with vector gauge fields is 


= 1 
L = yip — m)y — 51r - (Fu F”) 


= VID — my — FLF}. (5.44) 
If the gauge group is a simple Lie group, then there is a single coupling 
constant g. But, if the gauge group is semisimple, one which can be written 
as a product of simple factors, then there will be separate coupling constants 
for each factor. 
For our case, the Euler-Lagrange equations are 


oc oc 
a(d AY) 9A 
ðL oc 
"Sa ES s (5.46) 
a(d y) ov 
The first of these leads to 
9 Fi, — gf Aj Fa, = guy" Tiv, (5.47) 
which can be rewritten as 
D! A? =F A} — g fir Ay Ar (5.48) 


in terms of the covariant derivative of the gauge field. The other one gives 


(iD — m)y =0 (5.49) 
with D'iy = (2" + r ; a(x)) y (5.50) 


as previously. 

Because we need SU(3) for a treatment of quantum chromodynamics in 
the standard model, we shall simply list the main features here. Because the 
colors of the quarks, which we shall take as red, blue, and green, are three in 
number, we shall need an extension of the 2 x 2 Pauli matrices to 3 x 3 matrices 
and correspondingly the adjoint or regular representation will be 8 x 8. We 
start by defining 


010 0 —i 0 1 0 0 
aye OOP Bea se 0 o| xxm. (5.51) 
000 000 0.0 0 
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exactly as with Pauli matrices but only in the top left-hand corner. The com- 
mutators of pairs of these 


[^i Ai] = 2i fijkAk (5.52) 
reveal that the structure constants in this sector are simply the Eijk of Levi- 
Cevita. There is a fourth matrix, usually designated Ag, which commutes to 
zero with the first three, and with the standard normalization given by 


Tr- [^s, Aa] =2 (5.53) 


is taken to be 


1 1 
= 1 ; 5.54 
Ag B 2 (5.54) 


Then A4 and As, copying the first two Pauli matrices but in the second and 
third columns, are taken as 


Ze j 1 and Ze ) |. (5.55) 
219010 219i 0 
Similarly, Ag and 47 become 
«-A[: Q j ud z= f “| (659) 
21010 Uva: edi 


making 8 — 3? — 1 in total. 
The totally antisymmetric structure constants fik are 


Uk | fix 
123| 1 
147| 4 
156| —3 
246| 4 
257| 4 
345| i 
367| —4 
458 | 1/3 
678 | 1/3 
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with all nonindicated ones vanishing. However, there are now some totally 
symmetric constants denoted by dij, which are 


ijk | di 
"E. 
338 = 
V3 
146] 4 
157| 4 
247| —4 
256| 4 
344| 4 
355| 4 
366 | —4 
377| —4 
668 a 
778 te 
24/3 
888 | —~, 


with all nonindicated ones vanishing. You can see that there are eight colors of 
massless vector bosons, called gluons in this case. They couple only to quarks 
carrying three colors (normally called red, blue, and green) and self-coupling 
in the now familiar manner. These are the strong interactions and they are 
believed to be confining so that there is no free color. The general idea is 
that if we try to separate free color then the forces between them become so 
strong that new pairs of opposite and equal color changes are created and the 
free color remains hidden. There is no solution to this nonlinear field system. 
The usual attack is to move to a four-dimension space (time becomes the 
fourth space dimension) then to introduce a lattice structure and to attempt to 
find approximate solutions using very extensive national computer systems. 
The coupling constant g is found to change with energy scale. There is no 
known way to unify this strong force with the weak and electromagnetic 
forces. 
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Problems 


5.1 In the red and white wine puzzle, do the algebra in the easy case. 
5.2 Inthe red and white wine puzzle, do the algebra in the harder case. 


5.3 Without looking back at the text, write down ej; e!™ and then check 
your result. 


54 Again without looking back at the text, find the contracted forms 
of the product of é jx &"" and then check your results. 


5.5 Check Equation (5.19) making sure that you get the signs correct. 


5.6 Read again from Equations (5.28) to (5.33). Then repeat this calcu- 
lation until you get it correct. 


5.7 Read again the section leading to the covariant derivative in terms 
of the derivative of the four potential and the structure constants 
fi in Equation (5.48). Repeat without looking back at the text until 
you can confidently perform this construction. 


5.8 Write down the Pauli matrices and the unit 2 x 2 matrix. Work out 
all their products. 


5.9 Extend the work in Problem 5.8 to the 3 x 3 case, making sure that 
you understand the logic. 


5.10 Calculate the fix structure constants for the 3 x 3 case and check 
your results against the text. 


6 


Lie Group Techniques for the Standard 
Model Lie Groups 


The student who is happy to build a strong background in these topics is 
advised to consult H. Georgi [1]. Those who prefer a detailed mathematic 
derivation are advised to consult J.F. Cornwell [2]. The techniques for finding 
the explicit forms of the characters of the tensor irreducible representations 
of the unitary groups, symplectic groups, and both sets of orthogonal groups 
can be found in N.E.P. Samara and R.C. King [3]. 

The case of unitary groups has already been treated. Inow present the cases 
ofthe symplectic groups and the orthogonal groups, which are extracted from 
Samara and King [4] in a manner designed to show the N dependence in each 
case, as this is most frequently the information really required. 

For SO(N) in terms of the partition labels à we find 


À 
T (N+A; +A; —i-— j)x 
Des Vou HQ) >, (6.1) 
T (N-AN-AN+i+ j—-2) 
(i > j) 


and for SP(N), in a similar way, we find 


Pr 
T (N+A; +A; —i-—j +2)x 
Dacus p " H(a)| >. (6.2) 
m (N-AaN-AWN +i + j) 
(i< j) 
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Roots and Weights 


This will probably be familiar to the reader from quantum mechanics in oper- 
ator form. We want to find a complete commuting set of observables. In this 
case we write them as a Cartan subalgebra of Hermitian operators: 


H; = ô} > (6.3) 
[Hi, Hj] = 0. (6.4) 

We can normalize by 
Tr(H;H;j —kpóg for i,j =1tom, (6.5) 


where kp depends on the representation and the normalization, and m is 
called the rank of the algebra. 
The states of a representation D can be designated by 


Hilu, x, D > = uiu, x, D > (6.6) 


after diagonalization. The p‘ are called weights and are real. The m-component 
vector made of the j/ is the weight vector. Any other label that is needed to 
specify the state is denoted by x. 

The adjoint equation is particularly important. It has the rows and columns 
ofthe matrices labeled by the same index that labels the generators. We can call 
the state in the adjoint representation corresponding to an arbitrary operator 
V; as |V; >. The scalar product is taken as 


< VIV; > = (kp) 1 Tr(V, Vj) >, (6.7) 


where the dagger is included to allow for complex linear combinations of 
generators needed when we raise and lower states in quantum mechanics on 
operators for SU(2). The action of a generator on a state can be calculated as 


Vi|Vj > 2|W >< V| Vil V; > 
= (Vie > [Ti 
= —Ifij|Vi > 
= i fikl Vk >, (6.8) 


which is, of course, just the set for the commutator of V; and V. 

The roots are the weights of the adjoint representation. Because of the Cartan 
generators, commute the corresponding states have zero weight vectors. All 
states in the adjoint representation with zero weight vectors correspond to 
Cartan generators. The other states in the adjoint representation have nonzero 
weight vectors o; with components o, so that 


H; |Ea > = aj|E, >, (6.9) 
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which implies that 
[H;, Ea] em Eos (6.10) 


Notice that the E, are not Hermitian just like raising and lowering operators 
in SU(2). The normalization of states in the adjoint representation is taken to 
ensure 


< E,|Eg > =à Te(ELEn) = dag. (6.11) 


The weights o are called roots and the special weight vector a with compo- 
nents o; is a root vector. 
You may enjoy showing that 


[Es, E-a] = a.H, (6.12) 


which should remind you of the SU(2) commutation relation [J +, J ^] = Js. 
This analogy will be exploited to learn more about the representations of 
compact Lie groups. Generally, for any weight u of a representation D, the E3 
value is given by 


Eslu, x, D > = ux D. (6.13) 
Q 


Because the E3 values must be integers or half integers, 


2a + : 
ao^ integer. (6.14) 
Itis simple to show that 
ALE (6.15) 
o 2 RE. i 


where u + pa is the weight of the highest E3 state of the SU(2) spin j repre- 
sentation and q plays the corresponding role when lowering. 

We can easily make use of these results. Apply Equation (6.13) to both 
distinct roots œ and £ and we find (using E, as the SU (2) definition) 


a: 1 
f (517-9). (6.16) 


a 


Again, now using Eg yields instead 


= (-)-(p’ - 4’). (6.17) 


If 0,5 is the angle between the roots œ and f then multiplying the last two 
results gives 
_ (B (p-qXr —4) 


cos^(0,5) = dig 7 ^ (6.18) 
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Now this is important. Notice that (p — q)(p' — q’) must be an integer that 
is nonnegative so that there exist just four possibilities (up to complements) 
for the angles and between the roots. 

We can list these possibilities: 


| (p — qYX(p' — 4) | Pap 
0 90° 
1 60° or 120° 
2 45° or 135° 
3 30° or 150° 


You may think that we have missed two other possibilities. But (p — q)(p' — 
q^) = 4 corresponds to 0° or 180°. Neither is of any use. The first is in violation 
of uniqueness. The second is trivial because roots come in pairs of opposite 
signs, both in the same SU(2) group. 

We have met SU(3) before, but it will be convenient to learn a little more. 
What are the weights and the roots? Well T3 and Ta are already diagonal 
and normalized in the standard way. The eigenvectors and associated 
weights are 


O-HA- GAG- e 


These vectors, plotted in a plane, form the vertices of an equilateral triangle 
(Figure 6.1). 

The roots are differences of weights, because the corresponding generators 
take one weight to another. The generators clearly have only one-off diagonal 


(=1/2, N3/6) | (1/2, N3/6) 


$ (0, —N3/3) 


FIGURE 6.1 
The conjugate triplet multiplet. 
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FIGURE 6.2 
The octet multiplet and the singlet multiplet. 


entry each and can be written as 


1 i 

40 tiT) = Esto 
1 ; 

pr zE 1Ts) =E 1 8 (6.20) 
1 . 

Wo +iT7) = Fyre 


where the plus and minus signs are correlated. The roots form a regular 
hexagon, plotted along with the two elements of the Cartan subalgebra in the 
center (Figure 6.2). 


Simple Roots 


To complete the analogy between SU(2) and an arbitrary simple Lie algebra 
we need an idea of positivity for the weights. We can then treat raising and 
lowering operators and the highest weight. If every nonzero weight is either 
positive or negative we also know that if u is positive then (—) is negative. 

In an arbitrary basis for the Cartan subalgebra the components (u, etc.) 
of the weight arc are fixed. We decide that the weight is positive if its first 
nonzero component is positive and vice versa. (It actually does not matter 
what the basis is, but it feels better.) 

In SU(3), the three-dimensional defining representation then has a negative 
weight in the upper left-hand part, a positive weight in the upper right-hand 
part, and again, has a negative weight in the lower half. We define an ordering 
by u > v if (u — v) is positive and can now think of the highest weight 
in a representation. You may enjoy working out the SU(3) adjoint weights 
yourself. 
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C=O If the angle is 150° 
O_O If the angle is 135° 
O—O If the angle is 120° 


O O Ifthe angle is 90° 


FIGURE 6.3 
The four angles between the pairs of simple roots. 


We define simple roots as positive roots that cannot be made out of others. 
If a weight is annihilated by all the generators of the simpler roots, then it is 
the highest weight of an irreducible representation. Now, from the geometry 
of the simple roots you can reconstruct the whole algebra. I advise students 
to do this on their own. Using Equation 6.13 and noting that if œ and £ are 
different simple roots, then o — £ is not, and thus, | E; > is annihilated by E_, 
and |Eg > is annihilated by E_.; so you can show that 


uz (Ë (6.21) 
=O. (6.22) 


If you know the integers p and p’ for each simple root then you know the 
angles between the simple roots and their relative lengths. Indeed 


cos( ap) = PPP (6.23) 
and the angle between any pair of simple roots satisfies 
EI (6.24) 


where the first inequality follows from Equation 6.23 because the cosine is 
less than or equal to zero and the second inequality then follows because all 
the roots are positive. 

A Dynkin diagram is a shorthand notation for writing the simple roots. 
Each simple root is shown as an open circle. Pairs of circles are connected by 
lines, depending on the angle between the pair of roots to which the circles 
correspond. The scheme is shown in Figure 6.3. 

The Dynkin diagram determines all the angles between pairs of simple 
roots. There may, however, be choices for the relative lengths. We note that 
Figure 6.4 is the diagram for SU(2) and that Figure 6.5 is the diagram for SU(3). 


O is the diagram for SU(2) 


FIGURE 6.4 
The SU(2) multiplet. 
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O—O Is the diagram for SU(3) 


FIGURE 6.5 
The SU(3) multiplet. 


The Cartan Matrix 


You do not need to use dreadful geometrical calculations to keep track of the 
integers pÍ and q' associated with the action of a simple root o on a state 
|p > fora positive root ¢. The idea is to label the roots directly by their q! — p/ 
values. The q' — p of any weight, u, is simply twice its E3 value, where E; is 
the Cartan generator of the SU(2) associated with the simple root o. Because 


2H -at ; j 
2Es|n > = Wap >= (q'- p), (6.25) 
if A is the Cartan matrix then using Equation (6.25) and the form A of the 
Cartan matrix 
E (6.26) 


will give us all the same information as the Dynkin diagram. For SU(3), the 


Cartan matrix has the form 
ED. (6.27) 
-1 ee 


Finding All the Roots 


We can use the Cartan matrix to simplify calculating all of the roots from the 
simple roots. l 
The action of the raising operator E,; moves ¢ to ¢ + œ’. This just changes 
kj to kj+ı and thus q'! — p' toq! — p' + A", that is, 
k;—k jc 1, 
qi — pP > q — p + AÏ. (6.28) 
If we think of the q/ — p! as the elements of a row vector, this is equivalent to 
simply adding the jth row of the Cartan matrix, which is simply the vector 


q — p associated with the simple root a’. This speeds up the calculations of 
the roots. We will do it for SU(3). 
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Start with the simple roots in the q — p notation. Put each in a rectangular 
box and arrange them on a hortizontal line representing the k — 1 layer of 
positive roots, that is, the simple roots. 


k=1 [2-1] |-12] ol,o. (6.29) 


Now put a box with m zeros, representing the Cartan generators on a line 
below, representing the k = 0 layer. 


k=0 |00| Hi. (6.30) 


Now for each element of each box we know the q‘ value. For the ith element 


of a, qi = 2 because the root is part of the SU(2) spin 1 representation, 
consisting of E+œ' and a' - H. For all the other elements, q^ = 0 because 
à! — a! is nota root. 
Thus 
qg=2 0 0 2 
k=1 2-1 -1 2 al, o? 
k=0 00 H;. (6.31) 


We can compute the corresponding p’. 


p=01 10 
k=1 2-1 -12 at, o? 
k=0 00 H;. (6.32) 


Since the ith element of o/ is 2, the corresponding p! is zero. For all the 
others, p is just minus the entry. For each nonzero p, we draw a line from the 
simple root to a new root with k = 2 ona horizontal line above the k = 1 
line, obtained by adding the appropriate simple root. You can also draw such 
lines from the k = 0 layer to the k = 1 layers and the lines for each root will 
have a different angle. Then try to put the boxes on the k = 2 layer so that the 
lines associated with each root have the same angle they did between the 0 
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and 1 layer. These lines represent the action of the SU(2) raising and lowering 
operators. 


p=0 
g=0 
k=2 11 a, o? 
k=1 |24| |-12 al, o? 
k=0 00, (6.33) 


This is now trivial to iterate, for everything you need to go from k — 1 to 
k —1-1is on the diagram. For SU(3), the procedure terminates at k = 2 
because all the p's are zero. 


k=2 11 al +a? 
Lx 
k=1 21 -1 2 al, o? 
V. x 
k=0 00 H; 
/ \ 
k=-1 1-2 -2 1 eu ci 
Xy 
k=-2 -1-1 —al-a? (6.34) 


Fundamental Weights 


Suppose that the simple roots of some Lie algebra are œf from j = 1 to m. 
The highest weight, u, of an arbitrary irreducible representation, D, has the 
property that u + ¢ is not a weight for any positive root ¢. Thus, u + a? not 
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to be a weight in the representation j is clearly sufficient, because then 
E¥|u > — 0 for all j, (6.35) 


which implies that all positive roots annihilate the state. This is clearly an 
if-and-only-if statement. Thus, u is the highest weight of an irreducible rep- 
resentation. Hence, for every E7 acting on |“ > p = 0 and thus 


2a). y 

o £ (6.36) 
where the £/ are nonnegative integers. The £/ completely determine u. Every 
set of £/ gives u, which is the highest weight of some irreducible represen- 
tations. Hence the irreducible representations of rank m simple Lie algebras 
can be labeled by a set of m nonnegative integers ¢/. These integers are called 
the Dynkin coefficients. 


Consider the weight vectors, /, satisfying 
2a! . wk 
Tu» ^ jk. (6.37) 


Every highest weight can be written uniquely as 
u= X dui. (6.38) 
j=1 


The vectors uÍ are called the fundamental weights and the m irreducible rep- 
resentations that have these as highest weights are called the fundamental 
representations. We often denote them by D/. Note that the superscripts are 
just labels. The vectors also have vector indices. (Both run from 1 to m— 
confusing.) 

Now running the previous arguments backward gives 


that is, £/ is the q/ — p/ value of the simple root o//. 


The Weyl Group 


There is a symmetry that appears because there is an SU (2) associated with 
each root direction and all SU(2) representations are symmetrical under the 
reflection E3 > (—)E3. If u is a weight and E3 =a- a is the E3 associated 
with the root a, then 


apu 
o? 


E3|u > = |u > (6.40) 
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and the reflection symmetry implies that u — (q — p)o (where q — p = 2(77)) 
is a weight with the opposite E3 value. There are reflections for all roots that 
are transformations on the weight space and that leaves the roots unchanged. 

The set of all such transformations obtained in this way forms a transfor- 
mation group called the Weyl group of the algebra. The individual reflections 
are called Weyl reflections. The Weyl group is a simple way of understanding 
the hexagonal and triangular structures that appear in SU(3) representations. 


Young Tableaux 


You may have met Young tableaux in discussions of irreducible representa- 
tions ofthe symmetric groups. We will now see that they are useful for dealing 
with the irreducible representations of Lie groups. We will begin by discussing 
this for SU(3) but the real advantage is that it generalizes to higher groups. 


Raising the Indices 


The crucial observation is that the 3 representation is an antisymmetric com- 
bination of two3 representations, so we do not need the second fundamental 
representation to construct higher representations. We can write an arbitrary 
representation as a tensor product of 3's with appropriate symmetry. In fact, 
irreducible representations of SU(3) transform irreducibly under permuta- 
tion of the indices. Consider a general representation (n, m). It is a tensor with 
components 


separately symmetric in upper and lower indices and traceless. We can raise 
all the lower indices with e tensors to get 


T ; Loss 
gh Auk. kl = gels. : rd o. (6.41) 

Clearly, it is antisymmetric in each pair, k; <> £j, and antisymmetric in the 
exchange of pairs, ki, le <> k;£;. Now for each such tensor, we can associate 
a Young tableau (Figure 6.6). 

Think about the highest weight of the representation, (n, m). Because the 
lowering operators preserve the symmetry, if we find the symmetry of the 
tensor components describing the highest weight, all the states will have that 
symmetry. The highest weight is associated with the components in which 
all there is one 1 and all the k, £ pairs are 1, 3. All of these can be obtained 
by antisymmetrizing the k, £ pairs from the component in which all the k’s 
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k J... el RR IE 
Edu 


FIGURE 6.6 
The Young tableau of the (m, n) representation. 


FIGURE 6.7 
The empty Young tableau of the (2, 1) representation. 


N [99] 
e 


FIGURE 6.8 
The eight-dimensional Young tableau. 


FIGURE 6.9 
The three-dimensional Young tableau. 


are one, and all the £s are three. But this one component is symmetric under 
arbitrary permutations of the ¢’s. Thus, we will obtain a tensor with the right 
symmetry if we start with an arbitrary tensor with n+ 2m components and 
first symmetrize all the i’s and k’s and separately the ¢’s, and then antisym- 
metrize in every k, £ pair. 

In the Young tableau language we first symmetrize in the components in 
the rows then antisymmetrize in the components in the columns. The result 
is symmetric in the i’s and in the k, £ pairs. It is also traceless. 

In SU(3) the tensors corresponding to Young tableaux with more than three 
boxes in any column vanish. There are simple rules that allow us to calculate 
the dimension of multiplets from the corresponding Young tableaux. Consider 
the tableau in SU(3). First we start with the 3 of SU(3) in the top left corner 
and increase across the rows and decrease down the columns by unity in both 
cases (Figure 6.8). This works for any SU(n). Then multiply these numbers, 
so that we get 24. 

Now put the hook lengths into the boxes where the hook is up and to the 
right with the corner in the relevant box. Here we get the result in Figure 6.9 
and multiplying gives 3. The dimension of the multiplet is the quotient of 
these two numbers. Here, 24 divided by 3 gives 8. 
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Similarly, methods work for all the Lie groups except for the exceptional 
ones that do not coincide with nonexceptional ones. 


The Classification Theorem (Dynkin) 
See chapter 20, p. 244 of Lie Algebras in Particle Physics (2nd ed.) by H. Georgi. 


Result 


Coincidences 


1. Aj, By, C4 are all SU(2). 

A B2 = Co. 

. D3 = As. 

. Remove one more circle from D3 to get D» and it falls apart into two 
disconnected circles (the middle one must be removed to stay in the 


D, family). Thus, D» is not simple. This is the important statement 
that the algebra of SU(4) is the same as the algebra of SU(2) x SU(2). 


A OC) N 


This is the complete list of such coincidences. 


An O—O— ... —O—O 


B, O—O— ... -0—0—9 


O-O 


E; O U U 


FIGURE 6.10 
The four general dimensional Dynkin classes and the five exceptional Dynkin classes. 
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Problems 


6.1 Show that [E,, E-a = a.H] which should remind you of the SU(2) 
commutation relation [J *, J ^] = Js. This analogy will be exploited 
to learn more about the representations of compact Lie groups. Gen- 
erally, for any weight u of a representation D, the E; value is given 
by Esļu, x, D >= x n, x, D >. Because the E; values must be 


integers or half integers, ey — integer. 


6.2 Show that 


ur. s 
" E zP q) 


where u+ pa is the weight of the highest E; state of the SU (2) spin j 
representation and q plays the corresponding role when lowering. 


6.3 Wecaneasily make use of these results. Apply Equation (6.13) to two 
distinct roots a and f and we find (using E, as the SU (2) definition) 


a: B 


a2 


1 
()5 - 9). 
Again, now using Eg yields instead 


p-a 1 / 1 
g =OP -4) 


6.4 If 6,5 is the angle between the roots œ and f then show that multi- 
plying the last two results gives 


(a B)*  (p-aqXp' —4) 


2 
0,5) = 
cos" (Gg) op? 4 


6.5 These vectors, plotted in a plane, form the vertices of an equilateral 
triangle. Show that this is true. 
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H, 
A 


(-1/2, N3/6) | (1/2, N3/6) 


> Hi 


$ (0, -V3/3) 


6.6 The roots are differences of weights, because the corresponding 
generators take one weight to another. The generators clearly have 
only one-off diagonal entry each and can be written as 


1 : 
—(T, €il) = Eau 


Wa 
l LiT) =E 
JSTO 7 Ug 
1 : 

ua ea 1 8 


where the plus and minus signs are correlated. The roots form a 
regular hexagon, plotted along with the two elements of the Cartan 
subalgebra in the center: 


H5 


6.7 Show that 
«a-b EZ 
a? = ( bs 
6.8 Show that 
Bra vp 
m wos 


6.9 Thereader may enjoy the simple exercise of showing that the simple 
roots are linearly independent. By exploiting the completeness of 
the simple roots, it is easy to find all of them. 
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6.10 


6.11 


6.12 


6.13 
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2H - o! j j 
2Esļu >= "(a ^ >= (q' — p')lu >, 


if A is the Cartan matrix then using Equation (6.30) and the form A 
of the Cartan matrix 
= 2o - a! 
fe (ai)? 
will give us all the same information as the Dynkin diagram. For 
SU(3), the Cartan matrix has the form 


2 -1 
= a 
The action of the raising operator E,; moves ¢ to ¢ + o. This just 


changes kj to kj} and thus q'— p! to q' — p! + A", that isk; > k;--1. 


Start with the simple roots in the q — p notation. Put each in a 
rectangular box and arrange them on a hortizontal line representing 
the k — 1 layer of positive roots, that is, the simple roots. 


k=1 2-1 -12 al, a. 


Now put a box with m zeros, representing the Cartan generators, 
on a line below, representing the k = 0 layer. 


k=0 00 Ai. 


Now for each element of each box we know the 4! value. For the 
ith element of a’, q' = 2 because the root is part of the SU(2) spin 
1 representation, consisting of Ea! and a’ - H. For all the other 
elements, q! = 0 because o — a/ is not a root. 


Thus 


p=01 10 


k=1 2-1 -12}a@!, œ? 


k=0 00 Ai. 
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6.15 


6.16 


6.17 


6.18 


6.19 


Hence, for every ET acting on |u > p = 0 and thus 


(ai)? 


where the £/ are nonnegative integers. The gi completely determine 
u. Every set of £j gives u, which is the highest weight of some ir- 
reducible representations. Hence the irreducible representations of 
rank m simple Lie algebras can be labeled by a set of m nonnegative 
integers £j. These integers are called the Dynkin coefficients. 
Consider the weight vectors, n, satisfying 


2a! . wk 
su ce 


Every highest weight can be written uniquely as 
m . . 
j=l 
Now running the previous arguments backward gives 


that is, £/ is the q/ — p/ value of the simple root a/. 


The crucial observation is that the 3 representation is an antisym- 
metric combination of two 3 representations, so we do not need the 
second fundamental representation to construct higher representa- 
tions. Show in your own notation how to do this. 


Now for each such tensor, we can associate a Young tableau. In your 
own words show how this works. 


In SU(3) the tensors corresponding to Young tableaux with more 
than three boxes in any column vanish. There are simple rules that 
allow us to calculate the dimension of multiplets from the corre- 
sponding Young tableaux. Consider the tableau 


ky |... | kn | | eee in | 


je usa E 


in SU(3). In your own notation show how this works. 
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Noether's Theorem and Gauge Theories 
of the First and Second Kinds 


Perhaps the main point of Lagrangian formalism is that it provides a natural 
framework for the quantum mechanical implementation of symmetries. This 
is caused by the principle of stationary action taking the form of a variational 
principle in the dynamical equations of the Lagrangian formalism. Consider 
any infinitesimal transformation of the fields 

V^(x) > W(x) + ieF*(x) (7.1) 


which leaves the action 
V] =f dtL[w(t), V(t)] (7.2) 


invariant. Under an arbitrary variation of V(x) we get 


a= f^ at [s | aro +; x) + aca oe]. (7.3) 


Now assume that 5W*(x) vanishes for t — +00 so that we may integrate by 
parts, and write 


f d 8L]. 
w= [7s ae | n 


We see that the action is stationary with respect to all variations 5* that 
vanish at t — +00 if and only if the field satisfies the field equations 


aL[ Y(t), H(t) 


(x, t) = SWK(x, f) 


(7.5) 


Notice that we could have come at this from a different, and perhaps more 
familiar, point starting with a Lagrangian as a function of a set of generic 
fields W*(x, t) and their time derivatives V(x, t), when the conjugate fields 
II;(x, t) are defined as the variational derivatives. 
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For all such schemes we can introduce the Lagrangian density, £, a scalar 
function of V(x) and 


dY (x) 
ox 
so that the action is 
V 
= f d'*x £ (vo, 2 2) (7.6) 
ox" 


All field theories used in current theories of elementary particles have La- 
grangians of this form. Varying W*(x) by an amount 5W*(x) and integrating 
by parts we find the variation in L is 


ac ac 
3x k 
Ms fas [Ga wr] oF + aw P (7.7) 


so that with obvious arguments suppressed we get 


ôL ƏL 

— = —. 7 

awk — awk ve 
The field equations now read 


a ac O L 
ax o(QW^)(ox^) ^ ax* ` 


(7.9) 


These are knownas the Euler-Lagrange equations. As expected, if £ is a scalar, 
these equations are Lorentz invariant. In addition, t being Lorentz invariant, 
the action I is required to be real. This is because we want just as many 
field equations as there are fields. This reality condition also ensures that the 
generators of various symmetry transformations are Hermitian operators. 

We now come to the real point of the Lagrangian formalism—that it pro- 
vides a natural framework for the quantum mechanical interpretation of sym- 
metry principles. This is because the dynamical equations in the Lagrangian 
formalism take the form of a variational principle, the principle of stationary 
action. Consider any infinitesimal transformation of the fields 


V (x) > P(x) + ieF*(x) (7.10) 


that leaves the action invariant. If ¢ is constant, such symmetries are known 
as global symmetries. Of course, the action is invariant if the fields satisfy the 
dynamical equations. By an infinitesimal symmetry transformation we mean 
one that leaves the action invariant even when the dynamical equations are 
not satisfied. Now consider the same transformation with € an arbitrary func- 
tion of position in space-time, then, in general, the variation of the action will 
not vanish. But it will have to be of the form 


ER jj d*x] "a S 2) (7.11) 
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in order that it should vanish when e(x) is constant. If we now take the 
fields in I(W) to satisfy the field equations, then I is stationary with respect 
to arbitrary field variations that vanish at large space-time distances. These 
include variations of the form in Equation (7.10), so in this case ôI should 
vanish. Integrating by parts, we see that J "(x) must satisfy a conservation 
law 


Jra) _ 


ox" 


(7.12) 
It follows that 44 = 0 
where F= / ix] es (7.13) 


There is one such conserved current J “ and one constant of the motion F for 
each independent infinitesimal symmetry transformation. This represents a 
general feature of the canonical formalism, often referred to as Noether’s 
theorem: symmetries imply conservation laws. This theorem [1] is cited in the 
original German and in the English translation, which Einstein is known to 
have encouraged strongly. (Note that this theorem is by a woman author 
working alone when such things were far from easy.) 

Now we turn our attention to the treatment of first and second class re- 
straints and Dirac brackets [2]. The main problem to deriving the Hamilto- 
nian from the Lagrangian is the presence of constraints. Primary constraints 
are either imposed on the system (a good example is in picking a gauge for 
the electromagnetic field) or arise from the structure of the Lagrangian itself. 
A good example is found by considering the Lagrangian of a massive vector 
field V^ interacting with a current J,, where we have 


1 1 
L= (=) gE F” = 5 VaV" 4 J,V* (7.14) 
where F,,294V,—9,V,. (7.15) 


To treat all indices on the same basis we define the conjugates 


o£ 
II^ = — — =(-)F™, 7.16 
Gv >) (7.16) 
We find the primary constraint 
Mo =0. (7.17) 


Primary constraints are found when the equations 


u óL 
— bdow! 


Ty (7.18) 
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cannot be solved to give all the dp (at least locally) in terms of II, and 
V. This will be the case if and only if the matrix of the two first partial time 
derivatives of the Lagrangian has a vanishing determinant. Such Lagrangians 
are called irregular. 

Then there are secondary constraints, which arise from the requirement that 
the primary constraints be consistent with the equations of motion. For the 
massive vector field, this is just the Euler-Lagrange equation for Vo 


dT; = n? V? — J’. (7.19) 


There are many variations on this theme but we do not need them here. 
Much more important for us is the distinction between first and second type 
restraints. The constraints we have found for the massive vector field are of 
a type known as second class, for which there is a universal prescription for 
commutation relations. 

To explain the distinction between first and second class restraints, we recall 
the definition of the Poisson brackets of classical mechanics. Consider any 
Lagrangian L(Y, V) that depends on a set of variables V^(f) and their time 
derivatives Y“ (t). We define canonical conjugates for all of these variables by 


aL 
apr 


M, (7.20) 
The IIs and Ws will in general not be independent variables, but may instead 
be related by various constraint equations, both primary and secondary. The 
Poissson bracket is then defined by 


dA OB 9B OA 


[A B] = ; 
ow? aM, QW" 0M 


(7.21) 


with the constraints ignored in calculating the derivatives. In particular, we 
always have 


[v^, Ty] = ô (7.22) 


where from now onall fields are taken at the same time and time arguments are 
everywhere dropped. We call a constraint first class if its Poisson bracket with 
all the other constraints vanishes when (after calculating the Poisson brackets) 
we impose the constraints. Such constraints can always be eliminated by a 
choice of gauge. 

After all of the first class constraints have been eliminated by a choice of 
gauge, the remaining constraint equations 


xc (7.23) 


aresuch that no linear combination of the Poisson brackets of these constraints 
with each other vanishes. It follows that the matrix C of the Poisson brackets 
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of the remaining constraints is nonsingular: 
DetC 40 (7.24) 
where 
Cum = [Xn, Xu] . (7.25) 


Constraints of this sort are called second class. There must always be an even 
number of second class restraints, because an antisymmetric matrix of odd 
dimensionality has to have a vanishing determinant. 

Dirac suggested that when all constraints are second class, the commutators 
will be given by a simple modification when he called the resulting Poisson 
bracket the Dirac bracket. A powerful theorem by Maskuwa and Nakajima 
[3] was used to examine the Dirac bracket and its properties, but the issue 
appears to remain unresolved. 
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Problems 


7.1 Read from Equation (7.1) through Equation (7.5). Close the book. 
Now write out your own version of this section. 


7.2 Read the section immediately after Equation (7.5). Follow the sug- 
gestion and work out the calculation in this alternative manner. 


7.3 Using the action in Equation (7.6), integrate by parts to find the 
Euler-Lagrange equations as given in Equation (7.9). 


74 In your own words explain what you understand by the principle 
of stationary action. 


7.5 Starting from Equation (7.10) work, with the book closed, until you 
reach the result in Equation (7.14). 


7.6 Express the result of Problem 7.5 in simple English. 
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7./ Starting from the definition of the Poisson brackets of classical me- 
chanics, explain what you understand by primary and secondary 
constraints. 


7.8 Explain why there must always be an even number of second class 
restraints. 


79 Explain what Dirac meant by the Dirac bracket. 


7.10 Look up the paper by Maskawa and Nakajima. Explain in your own 
terms anything you understand. 
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Basic Couplings of the Electromagnetic, 
Weak, and Strong Interactions 


We start yet again with electromagnetism, which we examined in some detail 
in Chapter 7. Since Æ is not an independent Heisenberg-picture field variable 
we do not introduce any corresponding operator a? in the interaction picture, 
but rather take 


a°=0. (8.1) 


The most general real solution may be written 


-2 d? 
a(x) = (20) 7 lee c)a(p, o) 


+ e` P er (p, a)a'(p, v)] . (8.2) 


We can easily see that if (and in fact only if) the operator coefficients in Equa- 
tion (8.2) satisfy 


[a( p, o)], ol(p’, o)]- (p — p)5ac' (8.3) 
la(p,o),a(p, o)] 2 0. (8.4) 


The free photon Hamiltonian takes the expected form. The general Feynman 
rules yield 


d? . : 
(—1) Ap (x y) "Jas P (p) [e^ ^P (x i y) ats e'PYYa(y = x)], 


(8.5) 
where 
Palp) = Y, e«(po)e«(p,oc) (8.6) 
o=+1 
and p" in experiments is taken with 
p° = pl. (8.7) 
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From a practical point of view, the important thing is that in the momentum 
space Feynman rules, the contribution of an internal photon line is simply 
given by 


(inw 
(27x)tq? — ie RO) 
and the Coulomb interaction is dropped. (We have not even given a hand- 
waving argument for this.) It can be justified by a detailed analysis of Feyn- 
man diagrams but the easiest way to treat this problem is by path integral 
methods. 

We can now state the Feynman rules for calculating the S-matrix in a quan- 
tum electrodynamics. For simplicity we take a single type of spin 3 particles 
of charge q = —e and mass m. The simplest gauge invariant and Lorentz 
invariant Lagrangian for this theory is 


1 - 
L= (—) Fw F" — Vy" Q ie A;) e mv . (8.9) 


The electric current four-vector is then 


J'- T = (—)ieVy'w . (8.10) 
7 


It should now be obvious to the reader how the spin 3 particles enter using 
the familiar Pauli matrices. 

The reader will also know how in the early 1960s Gell-Mann [1] extended 
the approximate SU(2) isospin symmetry of nuclear physics to an even less 
exact SU(3) symmetry, which grouped the partially known baryonic and 
mesons into octets and decouplets. These are now known as: 


1+ — -1 
* i baryons p, n, Q9, x7, g 10 


e An octet of O- mesons K *9, +, n°, k~? 


e An Octet of 1 mesons K d p 'o, kK’ 
+, +,0, > +,0, KU, K ) 
0, «+,0 : m0, 


* A decouplet of i baryons A** 


After the successes of the chiral SU(2) x SU(2) symmetry in the mid-1960s, 
it was natural to suppose that the strong interactions also respect an approx- 
imate SU(3) x SU(3) symmetry, which like SU(2) x SU(2) is spontaneously 
broken to its diagonal subgroup, the Gell-Mann SU(3). Quantum chromody- 
namics revealed it arises because of there being not merely two fairly light 
quarks—the u and the d—but a third one s has the same charge as the d and 
is still fairly light. This means that the SU(3) x SU(3) symmetry consists of 
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independent SU (3) transformation on the left- and right-handed parts of the 
u, d, s quark fields: 


u u 


d | — exp | XO (Gy da + 04 Ag zl d (8.11) 


S a S 


where the 4, are the complete set of traceless Hamiltonian (3 x 3) matrices: 


0. 1 0 0 -i O 1.0 0 
à(2|100], a=]i 0 OF, =|1 -1 90], 

0.0 0 0 0 0 0 0 0 

0.0 1 0 —i 0.0 0 
142|00 0], »52[0 0f, ag 2[|0 0 1], 

100 i 0 0 0.1 0 

0.0 0 1 0 

; 1 

j;2|[00 i|], rAg=—]O 1 0 (8.12) 

0 i 0 v3 0 0 -2 


with Tr.(àaàp) = 26,5 as normalization. This general scheme does seem in 
one way or another to have continued to expand, so that we now have also 
a charmed quark c, a bottom (or beautiful) quark b, and the top quark t, 
which appears to be the heaviest of them all. The reader is warned that there 
are other patterns in which quarks could have emerged, with very different 
consequences. One such scheme is that of Barnes, Jarvis, and Ketley [2] where 
instead of quarks, and possible partners, appearing as representing multiplets 
of a given SU(2), new SU(2)s appear with new sets of quarks and possible 
partners. After charm, the next one in this scheme is called style, stealing from 
a Frank Sinatra song with the snatches “You either have or you haven't got 
style, if you have it sticks out a mile” and “Style and charm kind of go arm 
in arm.” 

Returning to the usual notation, by the mid-1960s, it was understood that 
the weak interaction processes of hadrons with each other and with leptons 
are well described at low energy by the effective Lagrangian 


Gr 
J/2 


where J^ is a hadronic current. Within the quark model, the commutation 
and conservation properties of J ^ allowed it to be identified with the quark 
current 


[ey (1 + iys)ve Ry (1 iys)lvu ^ +c (8.13) 


J^ = uy^(1-- iys)d cos 0, + iy (1 +iys)s sin® . (8.14) 
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Here 6, is an angle known as the Cabibbo [3] angle. Experiments on nuclear 
processes decaying from one state to another plus e* + v, and meson states 
decaying similarly, confirmed that Gr has almost the same value as that mea- 
sured in the purely leptonic process p+ — v -e* + ve and give for 6, the value 
sin. = 0.220 + 0.003. The natural conclusion was that the quarks provide 
another SU(2) x U(V) doublet with the form 


1c iys u 
( 2 a (8.15) 


together with right-handed singlets adjusted to give the quark charges 
Že and (—)4e. This has many problems, particularly yielding decay values 
for processes, such as K^ — yt + u`, many orders of magnitude greater 
than observed. Eventually this situation was clarified by Glashaw, Iiopou- 
los, and Maiani [4] who proposed that there was an extra term in J^ of the 


form 


Cy^(1 4- i ys)[(—)d sin 6; + s cos 6;] (8.16) 


where c is a fourth quark with charge Ze. The charged current may now be 
written as 


J^ = (ū cos b; —ésin6,)y*(1 + iys)d + (i sin0, + € cos0,) y^ (1 + iys)s, 
(8.17) 


which became known as the GIM mechanism and attracted large wagers. The 
main effect of this change was to suppress loop diagrams for s +d > d+, 
bringing the rate for K? — K° oscillations in agreement with the experiment. 

It was later noted by Weinberg [5] that this solves the problem of the 
strangeness changing Z’ interactions. In the context of the SU(2) x U(1) gauge 
theory the combination (—)d; sin 6; + sz cos 6, cannot be a singlet but must 
be part of another doublet 


Tis c 
( 2 Moa) (8.18) 


Particles containing the c quark in a c — ¢ bound state were discovered in 
1974 by Aubert et al. [6] and by Augustin et al. [7] and indicated a mass 
m; 7: 1.5G, Vj, which is not precise. This completed two generations of quarks 
and leptons. 

The first sign of a third generation was the discovery of a third charged 
lepton by Perl et al. [8], the t. Then a fifth quark type, the b (beauty), was 
discovered by Herb et al. [9], with charge (—) ze and a mass of about 4.5G, V. 
A sixth quark type, the t top with charge 2e, became theoretically neces- 
sary. It was eventually discovered by Ellis et al. [10], giving a combined 
value of the experimental results in the previous references of 181 + 12G,V 
in 1995. 
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The real point of the Lagrangian formalism is that it provides a natural 
framework for the quantum mechanical implementation of symmetry prin- 
ciples. The dynamical equations in the Lagrangian formalism take the form 
of a variational principle, the principle of stationary action. Consider any 
infinitesimal transformation of the fields 


W(x) > W(x) tie f*(x) (8.19) 


that leaves the action invariant 


0=5I =ie ex; sec (8.20) 


If £ is a constant, such symmetries are known as global symmetries. Of course, 
this is automatically satisfied for all infinitesimal variations of the fields if the 
fields satisfy the dynamical equations. By an infinitesimal symmetry transfor- 
mation we mean one that leaves the action invariant even when the dynamical 
equations are not satisfied. If we now consider the same transformation with 
€ an arbitrary function of position in space-time we see that 


V^ (x) — w*(x) + ie(x) f(x) (8.21) 


then, in general, the variation of the action will not vanish, but it will have to 
be of the form 


8I =(- f dbx] (x) - (8.22) 


in order that it should vanish when &(x) is constant. If we now take the fields in 
I[W] to satisfy the field equations then I is stationary with respect to arbitrary 
field variations that vanish at forge spacetime distances, including variations 
of the form Equation (8.20), so in this case Equation (8.21) should vanish. 
Integrating by parts, we see that J “(x) must satisfy a conservation law 


n 
DAN e. (8.23) 
ox" 
It follows immediately that 
B 
—=0 (8.24) 
where p= fess (8.25) 


There is one such conserved J “ and one constant of the motion F for each 
independent infinitesimal symmetry transformation. This represents a gen- 
eral feature of the canonical formalism, often referred to as Noether’s theorem: 
symmetries imply conservation laws. Einstein was so impressed that he arranged 
for the German original to be translated into English and the final outcome 
is given in Reference [11]. Note that the English version, being much later, 
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contains an outline of the applications that Noether's theorem has subse- 
quently found, especially in the calculus of variations and relativity theory. 
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Problems 


8.1 Calculate the differential and total cross-sections for ete~ > u* u- 
in lowest order in e. The electron and meson spins are not observed. 

8.2 Calculate the differential cross-section for electron-electron scatter- 
ing to lowest order in e. Assume that final and initial spins are not 
measured. 

8.3 Carefully defining your own notation, including normalization, de- 
rive the 3 x 3 matrix form of the Gell-Mann 4 matrices. 

8.4 State what you understand by the Cabibbo angle and give a rough 
numerical value for its sine. 

8.5 Explain how the Cabibbo angle entered into the standard model, 
and say why this natural idea had very real problems. 

8.6 Carefully explaining your notation, explain how Glashow, Iliopou- 
los, and Maiani made a proposal to remove the difficulties men- 
tioned in Problems 8.5 and 8.6. 

8.7 How did the Glashow, Iliopoulos, and Maiani, (GIM) mechanism 
first reveal its potential? 

8.8 Say in your own words what Weinberg noted about the GIM mech- 
anism, which revealed its possible potential. 
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8.9 What else did Weinberg notice about the GIM mechanism and how 
did he improve matters? 


8.10 What did Perl et al. discover and what was its significance for model 
building? 

8.11 Say in your own words what the real point is of the Lagrangian 
formalism for model builders. 

8.12 What does your answer to Problem 8.11 lead to? 


8.13 What is the familiar way of describing the content of Noether's 
theorem? 


9 


Spontaneous Symmetry Breaking and the 
Unification of the Electromagnetic and Weak 
Forces 


The usual starting point for this topic is the wine bottle potential depicted in 
Figure 9.1. Clearly, there is an unstable position of equilibrium for a ball under 
"gravity" on top of the hump in the center. Once this is disturbed there is a 
spontaneous breaking of the symmetry and one position on the horizontal 
circle (shown dotted) is selected at random. Obviously we are really talking 
vacuum expectation values and states here. The dotted line around the lowest 
point of the potential is a set of massless states of equal energy, which can be 
transported with effectively zero force. However, the selected state has a mass, 
as we see by the fact that it takes energy to move it up the wall. In practice the 
"bottle" is represented by a quartic in scalar fields with a positive coefficient 
for the fourth power terms but a negative coefficient for the quadratic terms. 
So we can write 


a($?)? — bo? witha > Oandb <0, (9.1) 
which has a minimum when 
249? —b=0, (9.2) 
b 
ll 2 = 
really <¢° > 52 (9.3) 
where the lowest point is 


(b2)? D? = D? 5 
a 4n2 on a; € 2a), (9.4) 


which can be set to zero by putting b = (—)/2a. The potential is then 
al?) — J2aq? . (9.5) 


This is perhaps a good moment to mention the two volumes of The Quantum 
Theory of Fields by Nobel Prize winner S. Weinberg [1, 2], which are a meticu- 
lous and detailed presentation of the standard model of elementary particles 
by one of its greatest exponents. Indeed, if one is interested in physics beyond 
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FIGURE 9.1 
The wine bottle potential. 


the standard model and the role of string theories in higher dimensions (which 
is the only known way to correctly introduce Einstein’s gravity) then Wein- 
berg’s two following volumes [3,4] on these topics cannot be recommended 
too highly. 

At any rate, we have adopted the notation of the first two volumes for this 
book, with the one exception that the parameters of group transformations 
carry opposite signs as a consequence of our choice of an active viewpoint. 

With the charge on the proton taken as e, we have the change operator Q 
related to the hypercharge Y by 


| B+S-L 


where B is baryon number, S is strangeness, and L is lepton number. 

As a consequence, the up and down quarks that make up the charges of the 
proton and neutron, appearing as they do in triplets as a result of their colors 
being absorbed into singlets, have charges 3 and 2, respectively. Similarly the 
negative charged electron e~ and its associated neutral neutrino v? form of 
doublet ( a ) of SU(2) x U(1). As far as we know there are two further gener- 
ations of this structure where the quarks are called charm and strange, then 
top and beauty (or bottom) in the third generation. Similarly the lepton pairs 
are the muon and its neutrino, then the tau and its neutrino form doublets at 
increasing masses of the charged particles in doublets 


m (G) m 


where the neutrinos were all thought to be massless. We say “were” because 
recent experiments have shown that at least one neutrino must have a mass 
that is not zero. The key experiments show oscillations between different neu- 
trino types. At any rate the doublets of charged leptons and their associated 
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neutral partners are left-handed ones projected by 


a+, (98) 


while the singlet positive right-handed charged partners of the charged lep- 
tons (positron, u™ and r^) are projected by 


a-y). (99) 


Returning to the wine bottle potential, we introduce a doublet Y = 1 made 
from a positive and a neutral meson and denoted by ( a ). With the potential 
written as 


V =w'o'o +A(o')’, with u < Oanda > 0, (9.10) 


the ¢ develops an expectation value 


«6s c I (D) vm = OF +00), (9.11 


as a result of the spontaneous symmetry breaking from SU(2) x U(1) which is 
a combination of the obvious U (0) factor with the U(1) given by the unbroken 
U (1) generated in the SU(2) by t3. 

Returning to the vector bosons (recall A, and F d that carry the forces of 
electromagnetism and similarly the ones carrying the U(1), which we now 
denote by B, and F,,, we note that they must all be massless (to start with, at 
least until the symmetry is spontaneously broken to U(1)) because there is no 
invariant A’, A” (or B, B") as the keen student can easily test. The Yang-Mills 
[5] Lagrangian is 


(OFERE) - G (Fun) 
= CSO — 0, Ai, + g Al, Abe? — 10.8. —9,B,Y, 
(9.12) 
which follows from the definition 
Fi, 29, A, — 9, A, + f" Al A (9.13) 


by substituting f‘/* = —ige'/* and similarly 0 for the U(1) case. 
What we know is that after spontaneous symmetry breaking there is one 


vector boson field of charge e with mass mw given by 


1 m 
W^ = A FEAT) (9.14) 
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and another of charge (—)e with mass my given by 


1 
Ww" = yg -iAj). (9.15) 
There are also two electrically neutral vector boson fields of masses mz and 0, 
respectively, given by orthonormal linear combinations of AS and B^. They 
are 


Z^ = cos(0) AË + sin(6) B" (9.16) 
A" = (—) sin(6) A! + cos(0) B" . (9.17) 


We have identified the A" as the massless photon field from knowing Q. This 
mixing angle is usually known as the Weinberg angle. 

To complete this picture we write down a Yukawa [6] coupling of the lepton 
doublet to the charged scalars in the form 


roa - oet) (o Jerre.. (9.18) 
EL 


Itis possible that there are other scalar multiplets in the theory (and in exten- 
sions of the standard model, such as supersymmetry, they are compulsory), 
but we will not consider them here. 

Clearly we need a gauge invariant term involving scalar and gauge fields. 
The most general form consistent with SU (2) x U(1) gauge invariance, Lorentz 
invariance (and, although we do not treat this here, consistent with renormal- 
izability), is 


1 mE ; 
£4 = (-)5 (Ou - 1A,T (9) iB, e| 
2 
_ H pto- > (ergy 
59 b= (99, (9.19) 


where A > 0. It is possible to perform an SU(2) x U(1) gauge transformation 

to a unitary gauge, in which $^ = 0 and 9? is Hermitian, with a positive 

vacuum expectation value. The real part of 9? is the only physical scalar field. 
The scalar Lagrangian then yields a vector meson mass term of the form 


1 2 


exer eno) os Ee - 89 (e) 
vg? 
4 


=(-) 


2 
wiwe 5 (g? +g?) Z, Z^. 
(9.20) 


The photon mass term is zero as expected. The W* and the Z? have masses 


2 /2 
EU uud (9.21) 


2 F 


Symmetry Breaking and Unification of Electromagnetic Forces 143 


respectively. Also, in the lowest order the electron now has a mass 
Me = Gev. (9.22) 


The extension to include the other two generations of leptons is straightfor- 
ward with the e and v, replaced by u and v, and t and v,, respectively. Also 
replace G, by 


G, = Ge(—) and so forth. (9.23) 


In the case of the muon, the exchange of W between low energy e and u 
leptons produces an effective interaction 


Gemma m 


which may be compared to effective V-A theory, which gives 


Gr 
N 


where Gr is the conventional Fermi coupling constant, which is known from 
the good description of muon decay to have the value 


[e^ Q + ys) ve Pur + y5)H] + fc. (9.25) 


Gr = 1.6663q(2) x 10?GeV? . (9.26) 
Comparison yields 
g 
2— =4V2Gr (9.27) 
m 
W 
so that 
2 
y= TEN _ = 247Cev. (9.28) 
& 23 G? 
We also see that 
0.511Mev 
e = ——— = 2007 x 10, 2 
G 2TGeV 07 x 10 (9.29) 


which is a very small value. Notice that mz > my. In terms of the weak mixing 
angle 
— ev  373GeV 
W'"2|sin(0)  |sin(6)| ' 
ev 74.6GeV 
Z = 2|sin(8)llcos(0)|  |sin(20)| ` 


(9.30) 


(9.31) 


There are many corrections, of course. In particular there is a very large radia- 
tive correction. Also we know that the coupling constants change with energy 
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scale. The best values are possibly 


38.4GeV 76.9GeV 


= EON and mz= [sin28)] : (9.32) 


mw 


It is now clear that, whatever the value of the Weinberg angle, these masses 
were too large to allow W or Z to be found before the early 1970s. Neutral 
current processes produced by Z? exchange gave perhaps the earliest confir- 
mation of the theory in 1973 in a bubble chamber of v, — e^ elastic scattering. 
(As an interesting side note, the Nobel Prize shared by three authors [7] was 
awarded before the discovery of the W* and Z? and despite strong advice to 
the contrary.) 
Moving on, the quark current 


J% = üy^( + ys)d cos(c) + üy^(1 + ys)sd sin(0,) , (9.33) 


where 6, is the Cabibbo [8] angle, had already been used in the 1960s to un- 
derstand weak interactions between leptons and hadrons by the low-energy 
effective Lagrangian 


Gr [Enl + ys)ve - £y. (1 + ys)v]J ^ + h.c.. (9.34) 


Experiments on processes such as O^ — N!*" + e* + v, showed that Gr 
had much the usual value and that sin(@,) = 0.220 + 0.003. This led to other 
mixings and eventually to the hadronic current 


u d 
J*=|c ly*A4+ys)V] s (9.35) 
t b 


between all three generations of quarks, where V is a 3 x 3 unitary matrix 
named after Kobayashi and Maskawa [9]. Much interest in research today 
centers on this matrix, and the existence of three families allows a complex 
phase in the matrix and thus to T and CP violation. One of the best tests of 
this part of the standard model is the annihilation of ete~ in colliding beans. 
This reveals resonances (of then unexpected narrowness) and threshold steps 
to plateaux of calculable heights. 
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Problems 


9.1 


9.2 


9.3 


9.4 
9.5 


Go through the argument from Equation (9.1) to (9.5) and draw 
your own wine bottle potential. 


Using Equation (9.6) work out your own changes for up and down 
quarks and for the charges of the electron and its associated neu- 
trino. 


Using the Weinberg angle, work out your own versions of Z^ 
and A". 


Work out your own versions of My and mz. 


Show that a 2 x 2 unitary matrix has no complex phase but a 3 x 3 
unitary matrix does have one complex phase. 
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The Goldstone Theorem and the Consequent 
Emergence of Nonlinearly Transforming 
Massless Goldstone Bosons 


At the deepest level this chapter and Chapter 13 could be viewed as an intro- 
duction to the work of Einstein on general relativity where he used curvature 
of the three spatial and one time dimensions of the world in which we live to 
describe the theory of gravity. Any reader fortunate enough to have a strong 
background in general relativity can regard this book as light bedtime read- 
ing. I had such a fortunate background from my undergraduate days reading 
special mathematics at King's College London, then arguably the leading 
place in the world for gravitational research. My meeting with quantum me- 
chanics terminated my possible research in the gravitational area with my 
heroes F. Pirani and H. Bondi. (My deep gratitude is to Pirani for helping me 
to change course to work with P. Matthews and A. Salam at Imperial College 
London.) At any rate, the idea is to use Sy, the two-dimensional surface of the 
points equidistant from the origin in three spatial dimensions, as the simplest 
nontrivial curved space as an introductory example, which we can all easily 
visualize, but to handle the mathematical details by the equivalent machinery 
to that used in more dimensions. 

In the United Kingdom, most serious general relativity research is done in 
mathematics departments, and even when a physics or astronomy depart- 
ment has a serious research interest, the subject is rarely covered at under- 
graduate 3 or 4 levels. At Southampton University, where there is a strong 
general relativity group and even an appropriate book written in house, no 
astronomy or astrophysics students are taught general relativity nor even 
encouraged (forced?) to attend the one course in the mathematics 4th un- 
dergraduate year. Only rarely do dedicated theoretical high energy physics 
students take the course in their first year of research. The situation is often 
far worse at other institutions. The hope is that this chapter and Chapter 13 
may act as a stepping stone for at least some of those students. 

Whatexactly are Goldstone bosons? Whenever a simple Lie group is broken 
spontaneously (smoothly and in an arbitrary "direction") then a theorem by J. 
Goldstone [1] tells us that massless Goldstone bosons (scalar or pseudoscalar) 
are produced carrying the quantum numbers of the generator of the Lie group 
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along the broken "directions." We shall see later how these bosons interact 
with each other, why they are massless, and how they interact with other 
(standard field) states of the system. For the moment, we shall concentrate 
on the breaking of SU» to U;. Here the SU, can be thought of as generated by 
rotations about the three spatial directions, and broken to leave simply the 
one generated by rotations about the third axis. Thus, the Goldstone bosons 
are associated with the first directions. If we call these bosons $1 and 4$», 
they can be thought of as interpolating fields for the two Goldstone bosons 
(subject to appropriate boundary conditions we shall meet later) with the 
transformations between them under rotations (we take the active point of 
view) as corresponding changes of the fields. Alternatively, we can consider 
these $4, A — 1,2, to be coordinates on the two-dimensional surface of the 
sphere and again (subject to conditions) to carry the information about the 
transformation of the points on the sphere. 

With an eye to future developments, we can regard the surface of the 
sphere as a coset space manifold of dimension two. If we introduce Pauli 
matrices by 


o; with i = (1, 2,3) 


then the SU? generated can be represented by 
Qi = exp (Fa) (10.2) 


where the é; can be thought of as angles of rotation about the respective axes 
x, y, z on (1, 2, 3), where £? = £j£; is an invariant, with the consequence that 
we find the commutators of the generators are 


[Q;, Qj] = iei Qk (10.3) 


where summation is implied over the repeated index, and the Levi- 
Civita totally antisymmetric tensor is specified by 


€ijk = 1 if (ijk) are cyclic on (1, 2, 3) 
= —1 if (ijk) are anticyclic on (1, 2, 3) 
= 0 if any pair of (ijk) are equal. (10.4) 


Here we have used 
Ojo; = ôij1 + ieija* (10.5) 


1 0 
where ie ui (10.6) 
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The keen students are strongly advised to become familiar with the index 
notation and the Pauli matrices if they are not already competent in their use. 

If we designate the coset space represented by the two-dimensional surface 
of the sphere, S5, by 


L = exp (Zea) (10.7) 


where A = (1, 2), then on left multiplication by a member of the LI; subgroup 
with parametrization hı we get 


hjL-dhLhQh (10.8) 
= L'h (10.9) 
and using 
L'(é) = L(£) (10.10) 
reveals that 
L(&) = h4L(E)h;! (10.11) 


so that the action of the U; subgroup on the field in the coset space is simply 
that of rotation through the angle parametrizing the subgroup. Note that the 
completeness relation on Pauli matrices is simply 


(oi) aB(9i)cp = 28ApÓsc — ócpÓAn (10.12) 


when thought of from the matrix point of view. 
Generalizing our ps sphere to a more general coset space, we can write 


gL =L'h (10.13) 


where g is the group (previously SU») and h is the subgroup (previously U1), 
and L’ is now a generalized version of L = As. Clearly the transformations 
are nonlinear. 

At this stage it is convenient to introduce the concept of the centralizer of 
a subgroup within a group. Here this would be written as C;,(g). It simply 
means the collection of elements in g, which commute with h to give zero. In 
the S case we had 


Cu, (SU) = Us (10.14) 


Now there is a very useful theorem by A. Borel [2] to the effect that when 
the centralizer is toroidal (a product of powers of U; factors) the coset space 
manifold is a Kahler manifold of order given by the sum of the powers of 
the U; factors. In the S case, the order is just 1. Such a manifold can support 
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supersymmetry of the same order as the order of the manifold. What does 
this mean and what are the implications? In the first place think about an 
ordinary two-dimensional plane with coordinates x and y. If you like, you 
can combine the coordinates into a complex variable, usually called z, by 


z=x+iy (10.15) 
with conjugate Z (or z*) given by 
z-x-iy (10.16) 


and the question which at once arises is: Does the definition of the deriva- 
tive of a single real variable apply to functions of a complex variable? The 
natural answer is that if f(z) is a one-valued function, defined in a region of 
the Argand diagram, then f(z) is differentiable at a point zo of that region. 
If f= fen tends to a unique limit as z — zo, provided z is also a point 
of that region, it is called the derivative of f(z) at z — zo and is denoted 
by f (zo). 

A function of z which is one-valued and differentiable at every point of a 
domain D is said to be holomorphic (sometimes equivalently regular or analytic) 
in the domain D. 

There is a necessary set of conditions for f(z) to be holomorphic. If f(z) = 
u(x, y) + iv(x, y) is differentiable at a given point z, the ratio of 


(f(z) + Az) — f(z) 
Az 


must tend to a definite limit as Az — 0 in any manner. Now Az = Ax +i Ay. 
Take Az to be real, so that Ay = 0, then 


u(x + Ax, y) — u(x, y) n [£ + Ax, y) — v(x, 2| 
Ax Ax 


must tend to a definite limit as Ax — 0. Clearly the p derivatives 2u and 


do Ou 


5, must exist at the point (x, y) and the limit is © + i2 aE -.. Similarly, if we take 


Ay —0 then 2 ay “ and ge must exist at the point (x, y) Ec the limit is a —i ide. 


y 
Since the two limits must be identical, on equating real and imaginary parts 


we find 


—=— a — (=) (10.17) 
ox oy oy ox 
These two conditions are called the Cauchy-Riemann differential equations. 
Obviously, assuming differentiability is much stronger than assuming conti- 
nuity. 

The sufficient conditions for f (z) to be holomorphic are that the continuous 


one-valued function f(z) is holomorphic in a domain D if the four partial 


derivatives gu : n ; u, and ge are continuous and satisfy the Cauchy-Riemann 
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equations at each point of D. Coordinates in the manifold are the Káhler 
coordinates. We shall stop this treatment here. Possibly we have already done 
too much. But this idea of holomorphy plays a huge role in many subjects. 
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Problems 


10.1 In your own words state what you understand about Goldstone 
bosons. 


10.2 Inyourown words state what you understand regarding the surface 
of a sphere as a coset space of dimension two. 


10.3 Define the Kronecker delta, 6;, and the Levi-Civita tensor, &;jx. 


10.4 Define the Pauli matrices o; with i = (1, 2, 3) and the associated 
unit matrix 1. 


10.5 Show that ojo; = bij 1+ ieijko". 


10.6 Show how SU(2) can be generated by a simple function of the Pauli 
matrices. 


10.7 IfQi- exp(- ;o;£'), calculate the commutator [O;, Q;]. 


10.8 State what you understand about the completeness relation and 
work this out for the case of the Pauli matrices. 


10.9 Ifwe designate the coset space represented by the two-dimensional 
surface of the sphere S, by L = exp(— £40 ^) where A — (1, 2), 
show that on left multiplication by a member of the U, subgroup 
with parameterization hı we get h,L = L'h; and find L’. 


10.10 Using L(&’) = L'(£) find the action of the LI; subgroup on the coset 
space and give your interpretation of this action. 


10.11 Generalizing our s sphere to a more general coset space, gL = L'h 
where g is the group, h is the subgroup, and L’ is a generalized 
version of L. Do you think that the transformations are linear or 
nonlinear? 


10.12 What is understood by the centralizer of a subgroup in a group, 
usually written as C, (g)? 


10.13 What does the Borel theorem tell us? 


10.14 Combining the two-dimensional coordinates of a plane denoted by 
x and y into a single complex variable z = x + iy, state what you 
mean by the derivation of a function f(z) at Zo. 
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10.15 State what you mean by a holomorphic function of z. 


10.16 If f(z) = u(x, y) + iv(x, y) show that the Cauchy-Riemann differ- 
ential equations 
Qu ðv ou ov 
ees o d Sy 
ox dy i oy ( T 
must hold. Is this differentiability weaker or stronger than conti- 
nuity? 
10.17 What are sufficient conditions for f(z) to be holonomic? What are 
coordinates in the manifold to be known as? 
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The Higgs Mechanism and the Emergence of 
Mass from Spontaneously Broken Symmetries 


There are actually several situations involving one, or more in the case of the 
supersymmetric version, Higgs boson. These can be presented in a variety of 
ways and in different gauges. We shall present one of these to make the basic 
idea as simple as possible. It is the first one studied by Peter Higgs [1] himself. 
We already know what happens when a gauge theory, such as the Goldstone 
model [2], is spontaneously broken. Massless scalars or pseudoscalars appear 
as Goldstone bosons corresponding to the broken generators. Then the mass- 
less gauge vector or pseudovector bosons absorb the Goldstone bosons and 
the result is that they develop masses. So now Higgs started by making the 
Goldstone model locally U(1) invariant in the, by now, familiar way. We note 
from the start that we begin with four degrees of freedom, one each from the 
two scalars and two from the massless electromagnetic field potential A"(x) 
corresponding to the transverse waves of electromagnetism or equivalently 
the independent senses of polarization of the photons. To fully exhibit the par- 
ticle interpretation of this theory, it is best to change to a “radial and phase” 
field description. Then we expect it to be simple when the symmetry becomes 
local, because it is exactly local phase variations that are being considered. In 
fact, the theory, now with A" present, is invariant under local phase changes. 
Indeed we can get rid of the phase field altogether. We start from 


1 —i0(x) 
$ = -z lf + »x)lex | | (11.1) 
d HY 
where fA = u. Under a local phase (or gauge) transformation 
$ > exp[—iex(x)]o(x), (11.2) 
A" — A" + 9.x (x) (11.3) 
and we see that 
p- p (11.4) 
6’ =O+ef x. (11.5) 
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If we pick the particular gauge function x by 


1 

x =( le 6, (11.6) 
we discover that 0' = 0. So the phase of the gauge transformed ¢ can be made 
zero. Butthis is a locally varying phase, or a phase field, and its quanta are the 
massless Goldstone bosons of the spontaneously broken U(1) symmetry. It 
seems that we can always pick a gauge x such that at all points the phase field 
is zero. So there are no Goldstone bosons! The massless scalar or pseudoscalar 
particles have disappeared from the particle spectrum. But they correspond 
to the original scalar, or isoscalar, degrees of freedom. Consider what the 
original A" and ¢ have been replaced by now. We find that 


uil uu. £07H 
A = A ra ; (11.7) 
,_ 1 
$ = E + p(x)]. (11.8) 


Consider the Euler-Lagrange. We find that for A"' we get 


[D + e? f?] A" — 8^3, A" = (—)e? A" (p? 4-2 fp). (11.9) 


In the weak coupling, or perturbative, limit we can ignore the electromagnetic 
current on the right-hand side of this equation. The left-hand side is the force 
particle wave equation for a massive spin one particle. The operator M is 
replaced by O + M?, where 


M - ef. (11.10) 


A massive spin one particle has two transverse degrees of freedom together 
with one longitudinal, making three degrees of freedom. So we see what 
happened to the Goldstone degrees of freedom. They got "eaten" by the 
gauge field A" to make the massive gauge field 


AP = AM IM (11.11) 


The 6 field is precisely the one whose quanta were Goldstone particles. This 
is the well known Higgs mechanism. In a sense, the two “masslessness” have 
canceled each other out. The result is that when a local U(1) invariance is 
spontaneously broken there are no Goldstone particles but the massless U(1) 
gauge field gains mass. There can be more than one Higgs boson; indeed 
with supersymmetric formulation there must be at least two. They can also 
be viewed in different gauges but they all come down eventually to what you 
have seen. Not only has supersymmetry not been found experimentally but 
neither has any Higgs boson. Notice that the Higgs boson does not specify 
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its own mass. I am one of a growing group of theoreticians who believe that 
the Higgs boson will never be found in the sense it is understood here. I 
personally have started to look for an alternative structure but if I find oneI 
shall still call it the Higgs boson. 
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Problems 


11.1 Read through from Equation (11.1) to Equation (11.6). Now close 
the book and repeat the calculations in your own notation if you 
prefer. 


11.2 Read through from Equation (11.7) to Equation (11.10). Now close 
the book and repeat the calculations in your own notation if you 
prefer. 


113 Explain in your own words how to count the degrees of freedom 
after Equation (11.10). 


11.4 Explainin your own words whatis usually understood by the Higgs 
mechanism. 
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Lie Group Techniques for beyond 
the Standard Model Lie Groups 


We may well start with technicolor, a scheme proposed by L. Susskind [1], 
which was based on a generalization of the color group SU(3) of the strong 
interactions. It was briefly fashionable but was completely ruled out by exper- 
imental results and has now been forgotten by most serious researchers. The 
grand unified groups (or GUTs as they are often called) are the main source of 
possibilities. This starts historically with a Pati and Salam scheme [2]. Salam 
himself was the first to point out the two main problems. In all such schemes 
the proton becomes unstable and there is no sign of this experimentally. Also, 
all such theories must contain a magnetic monopole [3,4] and again there is 
no credible experimental candidate. By far the most fashionable candidate 
was the proposal by H. Georgi and S.L. Glashow [5] to use SU(5), which con- 
tains SU(3) x SU(2) x U(1) of the standard model and has a five-dimensional 
multiplet containing: 


(3, D + (1, 2) (12.1) 


where the first factor in the brackets is the SU(3) multiplicity, the second factor 
is the SU(2) multiplicity, and the subscript is the U(1) number normalized to 
give zero for this 5 representation of SU(5). The various techniques we studied 
for the standard model Lie groups apply directly, including the classification 
theorem by Dynkin [6]. 

As we move on, an important constraint is that the groups we consider must 
contain the standard model groups. There are only four classes of simple 
groups, SU(2n), SU(2n — 1), SO(2n), and SO(2n + 1) where n is a positive 
integer. The symplectic groups Sp(2n) seem never to have played a part. 

The orthogonal groups are interesting because, like the unitary groups, 
they can have complex representations. As we shall see, SU(5) can be easily 
embedded in an orthogonal group. The group SO(10) [8], which has rank 5, 
contains the subgroup SU (5) x U(1) in two different ways. A spinor represen- 
tation of SO(10) is 16 dimensional and contains the 10 + 5 + 1 under SU(5). 
Hence, it neatly contains a single fermion family, plus a right-handed neutrino 
state. Higher dimensional orthogonal states contain several such families in 
a single irreducible representation. For n 4 6 orthogonal groups are anomaly 
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free and thus explain the mystery of cancellations of the anomalies in the 5 
and 10 representations. This is all so very convenient that it is hard to see 
grand unification stopping at SU (5). 

We turn next to SO(10), which is of rank 5. There are 45 gauge bosons 
transforming as the adjoint representation under SU(5) x U(1); the 45 of 
SO(10) has the decomposition 


45 = 24, + 1o + 101 + 10-1, (12.2) 


whereas the spinorial representations are 2^ = 16 dimensional. These are 
written as the product of five SU(2) spinors. Now we wish to identify the 
fermion states. This is easily done by choosing the SU(3) to be a subgroup 
of SO(6) acting on the first three operator components and the SU(2) to be a 
subgroup of the SO(4) acting on the last two spinor indices. Clearly 


SO(10) 5 SO(6) x SO(4) (12.3) 


so the product structure of the subgroups SU(3) x SU(2) is ensured [8]. 
By construction, the SU(5) gauge bosons couple to fermions, through the 
decomposition of 


SO(10) 5 SO(6) x SO(4) (12.4) 
= SU(4) x SU(2) x SU(2) (12.5) 
= SU(3)color x U(1) x SU(2)L x SU(3)R. (12.6) 


Under SU(3)color x U(1) x SU(2); x SU(3)g the fermions in the 16 trans- 
form as 


16 = (3,1, 2) + (1, 1, 2) (12.7) 
and the 45 gauge bosons transform as 
45 = (8, 1,1) + (1, 3, 1) +(1,1,3) + (1,1, 1) 
+ (3, 2, 2) + (3, 2, 2) + (3, 1, 1) + (3, 1,1). (12.8) 


It should be noted that only one of the two embeddings of the standard model 
SU(3) x SU(2) x U(1) can be achieved as the spinor states in the other one 
are never singlets under SU(2). 


FIGURE 12.1 
The multiplicities of the spin multiplets. 


Lie Group Techniques for beyond the Standard Model Lie Groups 159 


G, €—o 


FIGURE 12.2 
The four infinite families of the Dynkin classification. 


The Lie group techniques we need have all been done in the standard model 
section or are trivial extensions. But it is important to record the Dynkin 
classification. There are four infinite families denoted in Figure 12.1 with the 
shorter vectors indicated by filled circles. Moving to the exceptional cases in 
increasing size we have of these, Eg has the curious link to octonians which is 
the longest number base in which reality, commutation, and association have 
all been given up. Again, E7 and Es are real, which effectively rules out light 
particles. There are a number of coincidences. First Aj, B, C, are all SU(2). 
Second, B; = C». Third, D4 = A3. Fourth, if we remove one circle from D3 
to get D» it falls apart into two disconnected circles (the middle one must be 
removed to stay in the D, family). Thus, D» is not simple. This is the important 
statement that the algebra of SO(4) is the same as the algebra of SU(2) x SU (2). 
This is the complete list of such coincidences. 
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Problems 


12.1 
122 


12.3 


12.4 


12.5 


12.6 
12.7 
12.8 
12.9 
12.10 
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What two main problems did Salam point out with GUT models? 


Derive 5 = (3, 1)_ Nas (1, 2) of Georgi and Glashow for 
SU(5) 5 SU(3) x SU(2) x U(1). 


Find, or look up, the two ways that SU(5) x U(1) can be embedded 
in SO(10). 


Show that the 16 of SO(10) has the decomposition 10 + 5 + 1 under 
SU(5). 


Show that the adjoint 45 representation of SO(10) has the decom- 
position 45 = 24, + 1, + 10, + 10_; under the SU(5) x U(1). 


How many components have the spinor representations of SO(10)? 
Show how SO(10) 5 SU(3) color x U(1) x SU(2), x SU(3)n. 
Confirm the decomposition of the 16 in Equation (12.7). 

Confirm the decomposition of the 45 in Equation (12.8). 


Explain why the exceptional groups E; and Es are not thought to 
be useful in GUT schemes. 
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The Simple Sphere 


Now what is supersymmetry? At the most basic level it is a symmetry that 
relates bosons (integer spin particles) to fermions (odd half-integer spins) ina 
way we shall examine in more detail in later chapters. There is no real evidence 
for the existence of this symmetry. However, it does seem to have almost 
magical powers in controlling infinities and improving the convergence of the 
SU(3), SU(2), and U(1) running coupling constants at what is then regarded 
as a supersymmetric unification scale. 

We have already seen that Kahler manifolds have the hyperkáhler nature 
fixed by an integer and now we can reveal that this same integer is the one 
we met in the Borel theorem and is the centralizer of the denomination in 
the numerator when the coset space has the Ẹ structure. We see that the 
Goldstone bosons are a pair of pseudoscalar mesons with equal but opposite 
charges +e and they are massless in this approximation. The Kahler nature of 
the manifold ensures that each of the Goldstone bosons (pions in elementary 
particle physics) has a spin one-half partner of the same charge and also 
massless. You can see how very restricted the particle spectrum is in this 
model. Of course, the couplings are also specified so the scheme overall is 
very specified indeed. 

Of course, it is just a model and the real world is probably totally different. 
(We do not know. Nobody has found supersymmetry experimentally yet.) 
Nevertheless there is a huge amount of literature in particle physics on this 
type of work. Much of it comes from foreign research groups with the contri- 
bution from Japan being both extensive and very good in mathematical terms. 
So if supersymmetry is ever discovered, there are many ingenious models to 
be tested. 

To help ns understand this section we shall think in more physical terms, 
where the Su “1 structure is embedded directly into the chiral -model as pre- 
sented by P. "Chang and F. Gürsey [1] and S. Weinberg [2, 3], although we shall 
retain our own familiar notation for the most part. This structure was much 
used in the 1960s with three pions (pseudoscalars) and one c particle (scalar 
atthe basic level), where the changed pions form an isotopic spin doublet and 
the o is a singlet of isospin. This scheme did much to mirror current algebra 
results and in its nonlinear form gave a basis for a much used perturbative 
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scheme. The transformation laws in this scheme are 
TA — Wat € AsgÜsztg + hao (13.1) 
o — 0 — ATA (13.2) 


and we recognize a normal linear representation where the o is a singlet. 
(Now in chiral SU(2) x SU(2) the corresponding multiplet is four dimensional; 
there are three pseudoscalar fields.) In this case the nonlinearity results from 
imposing the chiral SU(2) invariant constraint 


c? E namA- f? (13.3) 


where f; is a constant to be determined from experiment, to eliminate the 
unphysical o field. The transformation law is then 


1 

TA — Wat EjBOsTtp + ba[ f — r] aB (13.4) 
where z? = z 474 and we have arbitrarily selected the positive square root. 
We emphasize that this is an example of the transformation laws derived 
earlier for a particular choice of our arbitrary invariant 0. The simple form of 
this transformation law, together with the intuitive feeling for the nonlinearity 
arising from the constraint, have made this a popular choice in the literature. 
However, we now turn to the stereographic choice of coordinates used by B. 
Zumino [4] to allow the introduction of supersymmetry by emphasizing the 
Kahler properties of the 2-sphere. Things now look very different. The two 
real coordinates on the sphere (pseudoscalar fields) are replaced by a single 
complex variable z: 


(13.5) 


where V is a potential function so that the usual cross-derivative constraints 
are satisfied, for this to be a Káhler manifold. This is often referred to as the 
existence of an almost complex structure. In this framework, the nonlinear 
transformation law for the coordinates takes the form 


2 


CWO WZ 
j — + — 13.6 
Z—>Zz+i03z+ 5 EN (13.6) 
where 


c is a constant (identifiable as 2 f+) and we note that this transformation law 
is holomorphic in z. It is simple to change to a more familiar pair of real 
variables, now x4(A = 1, 2), by setting 


z-—x cix, (13.8) 
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and we find that Equation (13.7) yields 


8An(c? — x?) | xax 
XA — XA €43pOsXp + OB | asl 2c F =] (13.9) 
where 
x? = XAXA (13.10) 
and Equation (13.7) has been used. It can be shown that [5] 
z cot() = [f2 — x?]?, (13.11) 
2cx cot(0) = c? — x?. (13.12) 
Direct comparison of these last two results gives 
4c? f2x? 
Pe ee 13.1 
1 [c Ep (13.13) 
or equivalently 
2CfrXA 
TA — E) iE x2 (13.14) 


as the connection between the two coordinate systems. Note again that c may 
be identified with 2 f, when the equality of the two coordinate systems is 
transparent in the small field limit. I hope that this simple example makes clear 
the advantage of working with the general coordinate treatment whenever 
possible. 

We start this long section by reviewing K.J. Barnes, P.H. Dondi, and S. 
Sarkar [6] and the structure of chiral SU(2) x SU(2) to establish notation. The 
transformation of the fundamental (quark) multiplet is specified by 


e ME DEPI 
q 14 ti 514 i&i 50 (ivs)q (13.15) 


to the lowest order in the real parameters 7; and &i,i = 1, 2, 3, where c! are 
the familiar Pauli matrices. Note the extra i ys factor in the final term, which 
is included to ensure that the Goldstone bosons of this scheme will be pseu- 
doscalar. We emphasize that this is precisely the usual symmetry of the quark 
and gluon QCD Lagrangian in the two flavor case (ignoring U(1) compli- 
cations), which leads to the familiar low-energy approximation to hadronic 
physics [7-10]. Our ys is not Hermitian, but self-barred, so that under the trans- 
formations in Equation 13.15 the quark mass term m@q is not invariant and 
so should not appear in the unbroken Lagrangian, whereas the kinetic term 
proportional to 7y^8,q is invariant because the y^ anticommutes with the ys 
in the axial generators. The crucial step in describing the Goldstone bosons is 
to parametrize the coset space defined by the quotient of the SU(2) x SU(2) 
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by the vector SU(2) parametrized by the £! alone. This takes the simple form 


^ 


L- exp —5i eno'tin} (13.16) 


where the Goldstone fields are described by 


Mí = Mni (13.17) 
with 
(n')(n') 21 (13.18) 
so that 
(M)(M!) = M? (13.19) 


and £ is an arbitrary dimensionless function of the quotient of M by a constant 
fs. Provided that & is proportional to this quotient in the limit of small fields, 
then f; is proportional to the pion decay constant. This arbitrariness may be 
viewed as the freedom to change coordinate systems on the coset space or 
to redefine the field variables describing the mesons. Notice that the Gold- 
stone fields M! really do serve to describe three pseudoscalar pions as usual. 
This notation is reserved for this general coordinate system (as opposed to 
zi for the nonlinear o-model coordinates, say), and we stress again that if £ 
is an arbitrary function of M (normalized to M for small fields) then all coor- 
dinate systems (with overlapping coordinate patches, i.e., not prohibited by 
singularities) are incorporated in this one description. If we define projection 
operators by 


PL = la +iys) (13.20) 

Pr = (1 — iys) (13.21) 
PLPL = P, (13.22) 
PrPR = Pp (13.23) 
Pi Pr = 0 = PrP: (13.24) 
PL +PR=1 (13.25) 


then we can write Equation (13.15) as 
Ê = LP +L!PpR (13.26) 


where L is unitary and the ys dependence is now contained solely in the 
projection operators. It is then clear that we can deal with 


L= exp -zi eno} (13.27) 
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and reinstate the ys factors only when wishing to consider the explicit cou- 
plings of the Goldstone bosons to matter fields. The action of a group element 
g (of SU(2) x SU(2)) on the coset space can be specified by T. Clark and 
U.T. Veldhuis [11] 


gL-—L'h (13.28) 
where 
L'(Mi) = L(M;) (13.29) 


specifies the nonlinear transformations of the Goldstone boson fields, 
1. 
h= E LI (13.30) 


and the 4; depend on the fields and the group parameters. What we have 
are nonlinear transformations among the M; (which give a realization of the 
group), which are linear under the action of the SU (2) subgroup, thus neatly 
describing asituation where the full group is still realized, butin a manner well 
suited to spontaneous breaking to the subgroup. The Goldstone bosons are 
a linear representation of the SU(2) subgroup only. Although the procedure 
extends to other representations, for our purposes it will be sufficient to stay 
mostly in the fundamental representation. 

We are now ready to discuss the chiral SU(2) structure embedded in this 
framework. Consider the subgroup of the chiral SU(2) x SU(2) group specified 
in Equation 13.15 by retaining only the parameters £j and £4 with A = 1 
and 2. Obviously this is an SU(2) subgroup and we call it chiral SU(2) in 
recognition of the (i ys) factors with the o ^ generators. Clearly the o? generates 
a U(1) subgroup so that the coset space obtained by the quotient of chiral 
SU(2) by this U(1) is parametrized by coordinates My, A = 1 and 2, which 
can be viewed as describing two Goldstone pseudoscalars. Notice that the 
embedding of this SU(2)/U(1) structure in the NO structureisuniquely 
specified. Moreover, if we set Ms and n3 to zero in our previous discussion, 
then 


1 
L= exp| -zi snag“) (13.31) 
and set A, = 0 so 
1. 3 
h = exp =a 130 (13.32) 
where & is now an arbitrary function of 


M? = M? + M, (13.33) 


which when M3 becomes zero remains as the only independent scalar. Al- 
though many readers will instantly appreciate the nature of this embedding, 
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experience has taught us that confusion often arises at this point and it is 
hoped that a more detailed discussion will not divert readers too far from 
the real theme. Suppose in this quark model, we define the vector and axial 
currents, as usual, by 


1 1 
Ve E Ty" 50i and At = qy" sailtys)q (13.34) 


and implement the transformations in Equation (13.17) by charges 


QY = I Von x and Qf = Í Aldex (13.35) 


by using free field communication relations. Naturally, while the symmetry is 
unbroken, the charges are time independent as a result of being constructed 
from the time components of the conserved Noether’s currents. The chiral 
SU(2) x SU(2) can now be written as 


[Q7, QY] = iei QU, (13.36) 
which is the algebra of the central (vector) subgroup, together with 
[Q/, QF] = iei Qf (13.37) 


confirming that the axial charges are in a three-dimensional representation of 
the vector subalgebra and 


[Q4, QF] = iei Q? (13.38) 


showing the closure of the axial parts of the algebra into the vector subalgebra 
and revealing the symmetric space structure which clarifies the coset space 
construction we introduced earlier. Now where is the chiral SU(2) algebra 
embedded? As there are only two inequivalent types of SU(2) in SU(2) x 
SU(2), this one must be equivalent to one of the more obvious ones. The left 
and right SU(2) algebras defined by the generators 


Qi = W + Qf) (13.39) 
QF = a - Qf) (13.40) 

have the property that 
[Qr, QF] 20 (13.41) 


so that the centralizer of either of these SU(2) algebras in the SU(2) x SU(2) 
is the other. This is quite unlike the way in which the vector subgroup is 
embedded as seen in Equations (13.37) and (13.38) so that the chiral SU(2) 
must be equivalent to the vector subgroup. We can take the unitary operator 
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that implements this equivalence to be 
1, 3 
U — exp zit Plo ; (13.42) 


which introduces the mapping 


Vi Ay 
yela (13.43) 
V3 A3 


which is a trivial relabelling of the form we took with é; and £4 as parameters, 
and obviously has the correct commutation properties. This equivalence con- 
firms that the coset space identified as the quotient of chiral SU(2) by the U(1) 
generated by V3 is indeed the two-dimensional surface of a sphere as we have 
claimed. The mapping in Equation 13.43 clearly mixes parity types, so for the 
physical applications we have in mind the basis of 03 with c 4(i ys) as genera- 
tors is the appropriate one justifying as it does our notation M4 as two fields 
describing pseudoscalar mesons, and dictating the form of the couplings to 
matter fields correspondingly, exactly as in the full SU(2) x SU(2) scheme. 
Note particularly that the chiral SU(2) never appears as the denominator of 
the quotient defining a coset space. It is a subgroup of SU(2) x SU(2), which 
is equivalent to the vector SU(2) but at no stage is considered as a conserved 
subgroup in a broken symmetry scenario. Thus the M; and subsequently 
the My, are always interpreted as fields describing pseudoscalar Goldstone 
bosons. The point is that chiral SU(2) is a subgroup of chiral SU(2) x SU(2) 
and when the latter is spontaneously broken to the vector SU (2) (with pseu- 
doscalars M;) then the chiral SU(2) is broken to the U(1) generated by V3 
(with M4 as the pseudoscalars). The broken chiral SU(2) scheme (embedded 
uniquely in all possible broken chiral SU(2) x SU(2) supersymmetrizations) 
is unambiguously defined in the framework provided by the very simple 
Kahler structure of the 2-sphere [12]. 

We can now see the advantages of using this chiral 2-sphere as a model. It 
is simpler than the chiral SU(2) x SU(2) scheme, even in the purely bosonic 
sector. Moreover, the 2-sphere is a Kahler manifold and so admits a super- 
symmetric extension in which the Goldstone bosons acquire fermionic (Weyl) 
partners without yet more quasi-Goldstone bosons and fermions being forced 
into the model. Also the resulting couplings among particles are uniquely 
specified. Contrast this with the situations in References 14 and 15 where the 
number of bosons doubles as does the number of associated fermions, and 
finally the couplings involving these new particles are not uniquely specified. 
Of course, these latter cases are closer to the physics of the real world (they 
have three pions, for example) but the embedded chiral 2-sphere model retains 
many significant features and is a far more tractable theoretical laboratory. 

In the following we give a treatment of most of the basic mathematics at the 
root of the subject of this book. Curiously, this begins by consideration of the 
embedding of the structure needed for the current problem into that of a larger 
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system that has previously been solved in general coordinates leading to a 
closed form involving only simple functions [6]. The embedding is unique. 
Thus the starting point is a review of this established larger system and its 
solution, in which the liberty of changing notation (slightly) for convenience 
has been taken. 

We recall that the SU (2) x SU(2) structure of (13.15), which we can conve- 
niently rewrite as 


S AE Ye 
anal Sa i 2 (iys)q (13.44) 
contains our S, coset space in the structure 


[( 2589 'Gy3][C2 22:0? Gy5)] 

[F180] 
as we noted previously. The SU(2) x SU(2) structure is spanned by the two sets 
of three orthogonal elements L; and R; satisfying the commutation relations 


(13.45) 


[Li, Lj] = i£ijkLk (13.46) 
[Ri, Rj] = i£ijk Re (13.47) 
[L;, R; = 0] (13.48) 
and the linear combinations 
V =L; + Ri (13.49) 
Aj = Li — Ri (13.50) 


are frequently used. The V; generate an SU(2) subgroup, which is parity 
conserving. An element of the SU (2) x SU(2) group may be specified by two 
sets of three real parameters and the alternative expressions 


g = exp(—il& Vi + G Ai]) (13.51) 
g = exp (—i6/'L;) exp ( — ið0F Ri) (13.52) 
will prove useful with 
6 = +G (13.53) 
OR = &; - ti (13.54) 


specifying the correspondence. 
Every element of the group can be decomposed into a product of the form 


g = exp(—inj Ai) exp(—ini Vi), (13.55) 


which is unique in a neighborhood of the identity element and this plays a 
crucial role in the general nonlinear realization scheme. The linear transfor- 
mation laws are best specified by giving the quarks a Dirac spinor in the usual 
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manner and taking 


i 1—ys 


be Rd 
qq "M ; 1 "A o — d (13.56) 
as the concrete infinitesimal form. 
Since the matrices 
1 
pz ary (13.57) 
1— 
Pr= — E (13.58) 


act as a standard set of projection operators, the treatment of linear transform- 
ing multiplets of SU(2) x SU(2) now follows trivially. 

To treat the nonlinear realizations of SU(2) x SU(2) in full generality, the 
Mi; of the adjoint vector of SU(2) must be considered in more detail. In the 
terminology of L. Michel and L.A. Radicati [13], the vector is said to be generic 
(or belong to the generic structure) if all eigenvalues of M are distinct. For 
the generic case, the minimal polynomial for the matrix is the characteristic 
polynomial satisfying the equation 


3 
[ [(M-m220 (13.59) 
A=1 


where the m 4 are the eigenvalues, which satisfy 


3 
yo ma=0 (13.60) 
A=1 


if the matrix is traceless. Thus the two vectors with components given by 
powers of the matrix in the form 


Mř = ; Tr([M]*o;) [o = 1, 2] (13.61) 


are a linearly independent set and the quantities 
3 3 
$4 = Tr([M]^) = $ "[ms]^ = Y map (13.62) 
B=1 B=1 


are two independent SU(2) invariants (S(1) is identically zero). At once it is 
clear that the general vector, which can be constructed from the M;, has the 
form 


ti = Fa Mai (13.63) 


where the F, are the functions of the two independent SU(2) invariants. This 
freedom has been discussed at length by S. Gasiorowicz and D. Geffen [7]. 
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From the point of view of field theory, it corresponds to freedom of choice of 
interpolating fields. Provided that F;(0) is taken to be unity and parity is re- 
spected, then all yj so defined are equally good interpolating fields. From 
a geometric viewpoint, the y; may be regarded as coordinates of points 
of the three-dimensional coset space manifold formed by the quotient of 
SU(2) x SU(2) by the vector SU (2) subgroup. The freedom is then viewed 
as the ability to change the coordinates within a local patch near the origin. 

Next we establish the transformation laws of the Killing vectors. It is suffi- 
cient to work to the lowest order in the group parameters, and we denote the 
transformations by 


gi Mi > Mic&Ki +K (13.64) 


where KY and K4 are Killing field components constructed from the M; 
themselves. The general theory is well described by S. Coleman, J. Wess, and 
B. Zumino [14] and C.G. Callan, S. Coleman, J. Wess, and B. Zumino [15] 
and we follow the general line of their arguments in our own notation. Of 
course, the action under an element of the U(1) subgroup is linear so that K 7 
is already known, but we shall let this emerge from our calculations. It will 
prove convenient also to work with the combinations 


KŁ = Z[KT + Kf] (13.65) 


NIe NIe 


Kj = [Kj = Kf] (13.66) 


ij 

corresponding to actions of the left and right chiral subgroups, respectively. 
The particularly important property is that these field components viewed as 
matrices have inverses so that 


Kry; Krig = p = Kgp;Kniq (13.67) 
KiyiKrj; = bij = Krig K gj; (13.68) 


follows at once from the free transitive nature of a translation action. Notice 
also that K 4 has an inverse, but Ky is singular. 

All the information required about the Killing vectors is, of course, con- 
tained in the action 


g exp(&i Ai) = exp(&j Aj) exp(ni Vi) (13.69) 
where, if we parametrize an arbitrary point on the manifold in the form 
exp(£i, Ai) = p(M) (13.70) 
then we can write 


gexp(&, Ai) = p(M) exp(ni Vi) (13.71) 
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where M; and n; depend upon both M; and g and where g is an arbitrary 
element of the full group. 

Next we apply the automorphism (induced by the parity) to Equation 
(13.71). If we denote by S(g) the transform of an arbitrary element g, then 
we obtain 


S(g)p- (M) = p( M) exp(ni V.) (13.72) 


since the axial generators change sign. Combining this with Equation (13.71) 
produces 


p(M)) = gp?(M)S(g~") (13.73) 


a result that has been emphasized by S. Coleman, J. Wess, and B. Zumino [14] 
and C.J. Isham [17]. This result has the advantage that n; has been eliminated 
so that it will immediately yield information on the Killing fields alone. In the 
quark representation we define 


D[p (M) = Uu (M)P. + u(M)P^] (13.74) 


where LI is unitary and unimodular and learn that 
, i i 
U?(M') = exp EI U?(M) exp EDU 
= UM) - 56 oU (M) ai zo U'(M)a, re (13.75) 


when Equation (13.73) is applied. Now we may combine this with the form 
of Equation (13.64) to learn 


-io;U? = 2U7 Kj (13.76) 
io; = 2U* KM (13.77) 


where we adopt the standard notation 


e e (13.78) 

for any Z and hence also find 
Kj)! =i Tr(U7,U 7o) (13.79) 
Kay ces quus) (13.80) 


since we know that these inverses exist. If we define the orthogonal transfor- 
mation on the adjoint transformation M; by 


1 
Mi > RjM; = 5 Tr[U' e;Uo; ]Mj (13.81) 
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then eliminating the derivative terms, we obtain 
a 1 E 
Ki, Kg; = le Tr[Uu ?o;U?o;] = (—) Rik Rej, (13.82) 


where the final step follows from the completeness relation. This is the main 
result of this stage of our calculations. The second step consists of expanding 
the whole of Equation (13.71) in the quark representation just as we treated 
that section of it contained in Equation (13.73). 

To present the results in a tractable form we introduce 


kpi =i Tr[U pU oi] (13.83) 
kkpi = (-)i T[UU pai], (13.84) 


which, as we shall see, prove to be the most important. Comparison with 
Equations (13.83) and (13.84) shows that the k;; are components of the Killing 
fields for an alternative scheme in which U is replaced by its unitary unimod- 
ular square root. At once, therefore, we have a counterpart to the basic result 
of Equation (13.82), and we write 


K,+R?=0 (13.85) 
kt + Rkg =0, (13.86) 


with an obvious matrix notation. The expansion of Equation (13.71) in the 
quark representation then yields the results 


2K;kg! =1+v—2R (13.87) 
2Krkz =1—v (13.88) 
2K;k;! =1+0 (13.89) 
2Krk;'=1—v—2R! (13.90) 


by similar computation to that which led to Equation (13.82) and the identity 
Qu uu =O (13.91) 
is the only extra element used. Note that 
v = Ki ky’ — Krk’ (13.92) 


emerges by algebra. 
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Simple substitution now yields at once 


v = kV(k- — k’)! = kV(k^)7! (13.93) 
R=(1+v)(1-v)! (13.94) 
Ky = ky -kg = ky (13.95) 
2KA = ka -- kv(kA) kv, (13.96) 


which are the required relations. Equation (13.95) is a trivial result because 
we have linear transformations induced by the vector SU(2) subgroup, but 
(as remarked previously) it is satisfying to see it arise as part of the general 
analysis. The remainder of these results reveal that a knowledge of k4 in a 
tractable form represents a complete solution for the entire nonlinear scheme 
we are studying. 

With the decomposition given in Equation (13.55), the action of a general 
element g of the full group may be written as 


gexp(-i& A) = exp(—ié; A) exp(—in V.) 
= L(M) exp(—iniV)) (13.97) 


where M; and n; both depend on M; and g. Then the primary result of the 
general theory is that 


g: Mi > M, (13.98) 


gives a nonlinear realization of the algebra, which is linear on the SU (11) vector 
subgroup. Moreover if h is an element of a vector subgroup and 


h: Yo > D(h)or Vr (13.99) 
is a linear (unitary) representation of that subgroup, then 
g : Vo > Diexp(—ini Vi)ler Vr (13.100) 


gives a realization of the full group. Notice that this latter transformation is 
linear in V but nonlinear (through 5;) in the M; when g is not the vector sub- 
group. Fields that transform according to Equation 13.101 are called standard 
fields and it is important to understand that by a suitable redefinition of co- 
ordinates any nonlinear realisation of SU(2) x SU(2), which is linear on the 
vector subgroup, can be brought into this standard form. In practice the most 
useful result is that, if one has a linear irreducible (unitary) representation of 
SU(2) x SU (2) such that 


8 : No > D[glor Nr (13.101) 
then Wo(M) = D[L * (M)lor Nr (13.102) 


transform as the components of standard fields. 
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It is now clear that there are three classes of fields to consider: 


1. Linear representations, which may be built up in the usual way 
as multispinors with the transformation laws defined by Equation 
(13.56). These will not be treated in more detail. 


2. Vectors M; transforming as the adjoint representation of SU (2) with a 
nonlinear transformation law under chiral action specified by Equa- 
tion (13.97). These will allow a description of the massless Goldstone 
bosons (pions, etc.) corresponding to the axial degrees of freedom 
spontaneously violated. The specification of invariants constructed 
(nonlinearly) from these is most important and will be exhibited later. 


3. Standard fields, which appear linearly in their transformation laws, 
but with nonlinear functions of the M; induced according to Equa- 
tions (13.101) and (13.98). These are important in describing matter 
(e.g., nucleons) interacting with the Goldstone bosons as chiral mat- 
ter. Once more, the specification of the corresponding invariants is 
most important and will be given later. 


The technical problem of finding the invariants is solved in Barnes, Dondi, 
and Sarkar [6]. A crucial step is the resolution of the powers of the matrix M 
in the form 


[M]^ = [mp] Pg = Mas Ps (13.103) 


where the Pg are two Hermitian matrices, each 2 x 2, with the properties 


PaPg =6,pPg (no sum) (13.104) 
Tr(P4) 21 (13.105) 
and 
2 
MEJLI (13.106) 
A=1 


where this 1 is the unit (2 x 2) matrix. Although the P4 are not in general 
diagonal, the projection operator 


1 
Pai = 5 T( Padi) (13.107) 
and 
1 
(Paun = Pai) + -ÓuN (13.108) 


where, because the P4 are complete, it follows that 


2 
bos (13.109) 
A=1 
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and introducing 


Pa = V2[ Pai — (1+ /2)1] Pj (13.110) 
with 
1 
A2Pa = pai + — pi 13.111 
Ai = Pai z” ( ) 


establishes that p,; are orthonormal. 

The second-rank tensors defined by the M; are conveniently handled by an 
extension of these ideas and fall into two classes. One such class is formed by 
the two independent tensors defined by 


1 
(Pas) = Pain; = 5 Tr Pahi Peh j) (A z B) (13.112) 
and 
1 
Lij = 2 Tr(P4A; Ppdj), (13.113) 


which have the properties 


Ib (13.114) 
IPAg =O= Papl (13.115) 
PapPcp = 94cóppPAn (no sum) (13.116) 


in terms of the matrix notation of the last section. Moreover, these are all 
Hermitian matrices and the trace of each Pp is unity. Since itis easy to show 
also that 


> Pap =1-1 (13.117) 
AZB 


where the sum is over all Aand B butexcluding terms with A = B, this gives a 
projection operator resolution in one sector of the space of these second-rank 
tensors and so I will decompose further. The second class of tensors may be 
identified with the independent matrices with components 


(Pap)ij = Pai PBj» (13.118) 


which span the subspace of 3 x 3 matrices projected out on multiplication 
by I from both sides and which are therefore orthogonal to the subspace in 
which the Pap lie. Since the py; are orthonormal, the multiplication law for 
the pag is 


Pai Ppj = Spy Pas- (13.119) 


It has been established by Barnes and Delbourgo [16] that all the independent 
second-rank tensors which can be constructed from the M; are spanned by 
the three independent pag and Pap. 
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The most general unitary unimodular matrix U constructed from the M; 
may be written in the form 


U = U4P4 = exp EDI where Y76,-0 (13.120) 
A=1 


but the 04 are otherwise completely arbitrarily independent functions of the 
independent SU(2) invariant S4 subject to the considerations of parity and 
weak field limits as mentioned before. This effective arbitrary function of the 
invariant is characteristic of the general solution and will persist throughout 
this work. 

It has been conventional to define 


264 — m4— ze (13.121) 
with 
ma = N2($4 + ) (13.122) 
so that, extending the notation used previously 
Mi = ópi (13.123) 
i = pi (13.124) 


follow immediately. Similarly, defining 


V2, = 04-8 (13.125) 
04 — 2i 4-0 (13.126) 


the y/4 may be treated as an independent (arbitrary) function of the ø, which 
then serves as the independent variant. 

The transformation laws for all realizations are now given by Dondi and 
Sarkar [6] in closed form and in terms of simple functions. Restricting atten- 
tion to first-order derivatives of the field with respect to space and time, and 
also restricting attention to a study of the Goldstone boson fields M; and 
the standard fields, the results can be given in terms of the general analysis 
of Barnes and Delbourgo [16] and Isham [17]. There are two important re- 
sults. First, although 8, M; and 8, Wr do not transform as standard fields, the 
covariant derivatives 


D, Mi = yi (13.127) 
D, Wr = 0, Vr = ivyi( Ti) rave (13.128) 
where under SU(2) 
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and where 
L^ (M)à,L(M) = exp(—& Ai), exp(-&Ai) = v, V; +a, A; — (13.130) 


have precisely this property. Second, they show that the most general La- 
grangian of the type under consideration may be written as a function of 
the standard fields v, D,Y, and D, Mi only; that is, the M; will not appear 
explicitly and the Goldstone bosons will be massless. It then follows that the 
Lagrangian so formed will be invariant under SU(2) x SU(2) if and only if 
it is constructed to be invariant under the SU (2) vector subgroup. This latter 
requirement is achieved by index saturation. 

The result given in Barnes, Dondi, and Sarkar [6] (now dropping the chiral 
projectors and normalizing for this problem) takes the concrete form 


[VA — Vs] 
v2 


ð EN 
D, Mi "age cce (Pag + Pra] «MO 


(13.131) 


and represents a complete specification of the required Lagrangian in simple 
closed form. This can be rewritten in the form 
do sin 
D, Mi = P ag zg Qu = mn) ; 

which is perhaps a simpler form, already seen by the reader. 

Noting the appearance of the projection operators, and realizing that the 
invariant Lagrangian density L is found by index saturation, it is straightfor- 
ward to see that it is proportional to a constant 


(13.132) 


de V? in? 0 
k= (=) nang + “Gr (ban — nang) (13.133) 
where we have now taken the indices to the A range, which would have been 
confusing in the previous equation. The condition for this to be Hermitian is 
obviously 


d0\"  sin?0 
(55) = age (13.134) 
with the solution 
0 
og —ctan 5 (13.135) 


being the one conventionally chosen, where c is a constant. When c = 1, the 
invariant Lagrangian density is 


c= EOE) (13.136) 


2[r? 4- ££] 
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where £ = r(M; + iMb) is used to emphasize the constant radius of the 
2-sphere. 

Using the geometric formulation of Isham [17] gives the coset space metric 
in the form related to the covariant derivatives as 


Sij (ðu Mj) (9^ Mj) = (D, Mi) (DMi) (13.137) 


and we have normalized g; j to dj; in the limit of zero fields. In matrix notation 
this yields 


Be! IPad Pa 
a= 4 [po vs] 
va £| 
P P a 13.138 
zb» Gan AEEA AB + pasin? | J | ( ) 


immediately because of the orthonormality. 

This is, perhaps, an ideal moment to tell first-year research students about 
what may actually be possible. Soon after completing his first-year courses 
and examinations, Chris Isham came to ask for suggestions for a research 
project. Professor Paul Matthews and I were not quite prepared for this! So 
Paulexplained to him what was possibly the problem of the decade and which 
neither of us had any idea of how to solve, and off Chris went to try his hand. 
A few days later Chris caught me alone and asked two technical questions, 
which I was able to answer. Soon afterward he came into the offce where Paul 
and I were working, put down a pile of paper and said, "Is this what you 
are looking for?" A glance showed us, although we did not understand the 
new nonlinear mathematics that he used, that he had solved the problem. 
Paul said, "Wonderful, now go away and put in the fermions." When Chris 
left, Paul wrote a title page, with the author “C.J. Isham” and handed it to 
the secretary to type for publication. Not long afterward Chris came with 
the next paper. Chris and I started writing a series of related papers. One 
day he showed me a letter and asked if I understood it. It was from one of 
the great East Coast universities in the United States, offering him a tenured 
professorship at a huge salary, and the right to appoint four new members of 
academic staff without consulting anyone. I advised him to show the letter to 
Paul and Abdus Salam. Chris wrote his PhD thesis and passed the interview, 
still before the end of his first year of research. After a year working at the 
Trieste Institute, he returned to Imperial College as a lecturer. He is still there 
as a professor of theoretical physics, although he is now much better known 
for his research on general relativity. We became very good friends and are 
still in contact. 

But enough of this; let us turn our attention to field redefinitions. We have 
alluded to the generality and utility of our parametrization. Now it is time 
to see the scheme in action. Consider first the coordinates resulting from a 
constraint on the singlet in the chiral SU(2) triplet representation. In chiral 
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SU(2), one introduces a multiplet 
Q — c 40Ays-c o (13.139) 


transforming as a qq bispinor, where 


0 ; 
4 — 4509 - i c iy)q (13.140) 
as we have seen previously. It is trivial to see that the group action on this 
three-dimensional multiplet is 


JA — Wat €AsgÜog + hao (13.141) 
o — 0 — bata (13.142) 


and we recognize a normal linear representation in which the scalar field is a 
U (1) singlet. In this scheme the nonlinearity results from imposing the chiral 
SU (2) invariant constraint 


o? t nara = f? (13.143) 


where f? is constant, to eliminate the o field. The transformation law is then 


TA — 7A EBIT + is f — n2] 54s (13.144) 


where x? = zm, and we have arbitrarily selected the positive square root. 

We emphasize that this is an example of the transformation laws derived 
earlier for a particular choice of our arbitrary invariant function 0(z). The 
simple form of this transformation law, together with the intuitive feeling 
for the nonlinearity arising from the constraint, have made this a popular 
choice in the literature. However, we now return to the stereographic choice of 
coordinates used by Zumino [4] to allow the introduction of supersymmetry 
by emphasizing the Káhler properties of the 2-sphere. Things now look very 
different. The two real coordinates on the sphere (pseudoscalar fields) are 
replaced by a single complex variable z. Now the metric is an Hermitian form 
and the nonzero components are written as 


a?V 


= — 13.14 
0ZOZ (Si) 


g zz 
where V is a potential function so that the usual cross-derivate constraints [6] 
are satisfied for this to be a Kahler manifold. In this framework, the nonlinear 
transformation law for the coordinates takes the form 
—.2 
Bip ee gis eR qo (13.146) 
2 2c 
where w= hı +i¢@p. (13.147) 


c is a constant (identifiable as 2 f+) and we note that this transformation law 
is holomorphic in z. Of course, it is simple to change to a more familiar pair 
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of real variables, now x4( A = 1, 2), by setting 
z= x% tix (13.148) 


and we find that Equation (13.142) yields 


ôag(c? — x?) | xax 
XA — XA + £43B0XB + $B | As 2c + ^ J (13.149) 
where x? = xX4XA (13.150) 


and Equation (13.147) has been used. By comparing the 6 4g terms in Equations 
(13.64) and (13.142) we discover 


x cot(0) = [f2 — x7]? (13.151) 
and similarly comparing Equations (13.144) and (13.151) 
2cx cot(@) 2 c? — x? (13.152) 
emerges. Direct comparison of these last two results gives 


2 do^ fex 


= 13.153 
5 [c? + x?]? ( ) 
or equivalently 
2c frXA 
TA E (13.154) 


as the connection between the two coordinate systems. We note again that c 
may be identified with 2 f+ when the equality of the two coordinate systems is 
transparent in the small field limit. This simple example, I hope, makes clear 
the advantage of working with the general coordinate treatment. 

If you have no background or interest in particle physics, move on to an- 
other chapter. I am not sure whether this section should be included at all, 
but it is short, so here we go. 

Think about the chiral sphere y as previously described. In the parti- 
cle physics interpretation there are two pseudoscalar Goldstone bosons with 
equal and opposite unit charges +e. They must be massless. Of course, in 
the real world, experiment tells us that they have masses that are not zero, 
but are the lightest of all masses in the hadronic sector. Now, recall the Borel 
theorem of Chapter 10, which tells us that because the centralizer of the U(1) 
in SU(2) is just itself we have a single supersymmetry. In practice, this integer 
controls the number of supersymmetries that can be imposed. This becomes 
exceptionally important in advanced topics, such as the Maldacena conjec- 
ture. It allows what might be thought of as improvements to this already very 
important topic. 
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Just in case even a single reader has been inspired to work in theoretical high 
energy particle physics, I am adding a few extra remarks. To other readers 
my apologies, but you can read this section too if you feel so inclined. 

The first thing, of course, is to find a good supervisor (interested in these 
topics) at a good university physics or mathematics department and then to 
get yourself a support grant from the appropriate research council. It is much 
easier to survive on these grants than it was just a few years ago. 

Then despite having years of training to do, invest in a copy of the book 
Harmonic Superspace by A.S. Galperin, E.A. Ivanov, V.I. Ogievetsky, and E.S. 
Sokatchev [18], who worked together at the joint Institute for Nuclear Re- 
search in Dubna, Russia, where harmonic superspace was born. The first 
draft of the book was written some years ago when all four authors were 
working together at the Bogoliubov Laboratory of Theoretical Physics. I find 
in every generation there is some outstanding worker whose research papers 
must be followed at almost any cost. Ogievetsky was one of my favorites, and 
his death was a huge loss. I am confident that his colleagues would not mind 
my making special mention of him. 

In this book you can learn how our simple sphere can be used to ex- 
tend the index of Kahler manifolds and therefore the number of supersym- 
metries. You can also learn how to improve upon the famous Maldacena 
Ads/CFT conjecture [19]. In fact, the whole book is a real treasure-house in the 
subject. 
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Problems 


13.1 Read through Equations (13.1) to (13.4). Now close your notes and 
find the transformation law for z 4. 


13.2 Check that you can recover the nonlinear transformation law for z 
in Equation (13.6). 


13.3 Read through Equations (13.8) to (13.14). Now close your notes and 
find the expression for z 4 connecting the coordinate systems. 


13.4 Without using your notes, explain why in the embedding of SU(2) 
into SU(2) x SU(2) itis crucial to express the parametrization of the 
quotion coset space in the form 


L- exp —sieno'i)} 


and say what each of the quantities in this expression is and what 
role it plays. 


13.5 Show how the projection operators P, = $(1+ iys) and Pr = 4 


2 
(1 —iys) work. 


13.6 Read through your notes from Equations (13.1) to (13.12). Now close 
your notes and write how the form of g = exp(—i£; Aj) exp(—i£; V.) 
is reached. 


13.7 Read through your notes from Equations (13.21) to (13.25). Now 
close your notes and work through this for yourself. 


13.8 Read through your notes from Equations (13.26) to (13.39). Now 
close your notes and try to do this for yourself. This is a hard section. 
You may refer to your notes whenever you need, but make sure that 
you can do all the steps eventually. 


13.9 Read through your notes for Equations (13.40) to (13.53). Now close 
your notes and reproduce this material for yourself. 


13.10 Read through your notes from Equations (13.1) to (13.5). Now close 
your notes and do this for yourself. 


13.11 Read through your notes from Equation (13.6) until you have seen 
the three classes of fields we need to consider. Now close your notes 
and do this for yourself. 


13.12 The next section from Equations (13.8) to (13.24) is generally 
thought to be difficult. Please read it until you feel sure that you 
understand it. 


The Simple Sphere 


13.13 


13.14 


13.15 


13.16 


13.17 


13.18 


13.19 


13.20 


13.21 


Read your notes from Equation (13.25) to Equation (13.31) and make 
sure that you understand it. 


Read your notes from Equation (13.32) to Equation (13.35) and the 
next paragraph. Make sure you understand it and why it is being 
done. 


Using the results of Problem (13.14) and inventing your own nor- 
malization, derive the expression for D, M; in Equation (13.128). 


Read through your notes from Equation (13.37) to Equation (13.41). 
Make sure you understand why this last equation represents the 
surface of the two-dimensional sphere. 


Read the section of your notes from Equation (13.1) until you know 
what happened to Professor Chris Isham. Think about this. 


Read your notes from Equation (13.44) to Equation (13.49) until you 
are sure you can do this for yourself. 


Make sure that you understand the stereographic coordinates used 
by Zumino. 

Check Equation (13.147) for yourself. Check the law is holomorphic 
in z. 


Work out the expression for x4 in Equation (13.42) for yourself. 
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Beyond the Standard Model 


The world we see is Poincaré invariant. It also has internal symmetries—both 
global and local—the latter giving rise to interactions (forces) described by 
gauge theories. Can there be more? Can we mix Poincaré and internal? Only 
by supersymmetry, which mixes bosons with fermions. 

We need to establish notation. The basic objects we need will be spinor 
representations of the SO(1, 3) Lorentz group. We take the metric 2"" defined 


by 


g —1 
gil = — gil 
g? =0. (14.1) 


The primary tool we need is the Clifford algebra. We introduce the anticom- 
mutator 


{fy vy y= 2g"" (14.2) 


where as we know the Dirac matrices have a unique representation (up to 
equivalence) as 4 x 4 matrices. By alternately forming commutators and an- 
ticommutators we find all 16 independent matrices. In this notation y? is 
Hermitian and the y! are antihermitian. 

One representation, with useful low energy properties, is given in terms of 
the Pauli matrices 


i(i JaC a eG 4). ee 


which have products 
alg! = Sl + igiikg ot (14.4) 


in terms of the Kronecker delta and Levi-Civita tensor. We can take 
1 0\; 0 o! 
0. m 
y x A) ite 3! (14.5) 
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which have the required properties. Building the rest of the Dirac matrices as 
suggested gives sequentially 


i 
g™ = Be y" (14.6) 


which in the above representation take the form 


i 0 io j uk(o* 0 
Oi __ : ij _ ijk 
o = o 0 IL =€ ( 0 4) . (14.7) 


Then, realizing that the maximum number of Dirac matrices in a product is 
four, we introduce 


y^ ey y yay? 


1 
= ateonYv'y^y'. (14.8) 


Here the notation is £9123 = 1. 
In this representation we find 


0 -i 
Pep 37 (14.9) 


which is antihermitian, squares to —1, and has 
B^ y eg (14.10) 


as a property, which clearly "extends" the Clifford algebra. 
We then define 


“= ily“, y’] 
i 
x e y yy, (14.11) 


to complete our set of 16 basic 4 x 4 matrices. In our representation these take 
the forms 
0 1 
08... 
o. 


o® = G i (14.12) 


where we can easily check that the notation is designed so that we have 
y9?riy? 2 rg (14.13) 


for R = 1,...16. See Chapter 4 for the y matrix product table. 
At this stage we had better think about the Poincaré group to put our 
notation into a physical context. The transformations consist of translations 
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in space-time, together with spatial rotations and Lorentz boosts to different 
velocities. We can write this in the form 


x" = AU x" +a" (14.14) 
where the a“ and A", are constants, and the constraint equation 
E EP AT. (14.15) 


ensures invariance of our metric. 
As Lie assures us, we can work without loss of generality in infinitesimal 
form 


AM = 6" + af 
g^ = eM (14.16) 


with constant small parameters £” and œ”, and the constraint equation sim- 
ply tells us that 


o Lg (14.17) 


so we have four parameters e“ for the translations and six parameters o"", 
where w'/ describe rotations and w” describe boosts. We can view a small 
group element as 


nr d sep (14.18) 
and expanding our transformation as 
x! = x" p x" + e" 
= xh + Wap gh x? + & gh 
ET saplghx! — gh x"] + e gt (14.19) 


we discover that the generators satisfy the algebra of the Poincaré group by 
having the action on fields (not coordinates!) specified by 

P“ = ið" (14.20) 

and M"? = i(x*9f — x£9"). (14.21) 


It is then easy to confirm the algebra as 


[P^, P] 20 
[M"", p^] = i(g’? P^ uM gt? P") 
[M"", M^] = i(M'^g"? — M"? g" + M” gl — Mp np), (14.22) 


We note that the translations commute, and form a vector representation of 
the homogeneous Lorentz group whose generators satisfy the algebra in the 
final line. 
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Recall how the states are found in quantum mechanics. We look for a com- 
plete commuting set of observables. An obvious first choice are the P" since 
they commute with each other. So we set 


P" |p") = p"|p") (14.23) 


and recognize that P? = P" P, is an invariant. Now we can distinguish the 
massive and massless cases. 


Massive Case 


For the m Æ 0 case we work in the rest frame where E = p? = m and p = 0. 
We then ask for members of the algebra commuting with the commuting 
observables we already have and discover the little group, also known as the 
stability group, generated by the M". These are rotations of course, and by 
writing 


MÏ = lik pk (14.24) 
or Ji= se (14.25) 

we reveal the algebra 
[P els (14.26) 


of SU(2) or SO(3) or angular momentum. We now add J, and J* = JJ! to 
the complete commuting set of observables in the form 

^j, m = jG * 2j, m) 

Jj, m) = m| j, m) (14.27) 


where j is an integer or 4 integer, and m takes all values between — j and j 
in integer steps. We now have the massive particles we know and love. 


Massless Case 


In the m = 0 case we go to the frame where p" = (E;0, 0, E) and discover 
that the little group is now best described by introducing 


1 
Ww" = 55" ^ P. My, (14.28) 
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where we note that P, W" = 0 = W” has only three independent compo- 
nents. The algebra of the little group is now (in this frame) 


[W!, W?] 20 
[(W3/E), W^] 2 iw! 
[(W?/E), Wt] = -iW?, (14.29) 


which we recognize as a [2] Euclidean algebra, where the W! and W? are 
translations and n is the rotation in the plane. This is not a compact group 
algebra (nor is the Lorentz group, of course) and it can only have finite di- 
mensional representations if they are nonunitary. In practice we have only 
found particles in which W! and W? have zero eigenvalues. (They could in 


principle be continuous.) We then define the helicity, a pseudoscalar, by 


H 
n,W 


A= 
n, P^ 


(14.30) 


where n, is an arbitrary vector with n, P" Z 0. We usually take n, along the 
time direction when 
ies (14.31) 
Ip 


and we speak of the component of the angular momentum, or spin, along the 
three-momentum. We now have the massless photon with two polarization 
states, and so on. 


Projection Operators 


Now let’s return to our Dirac matrices, and consider the transformations of 
Dirac four-component spinors (either wave functions or fields—just reverse 
the sign of the parameter if you have a vector operator) under the Lorentz 
group. We write 


Wal AX) = SPW p(x) 


= Walx) — S (x^) ws(x)-4... (14.32) 


and need the &“” to be a realization of the M"". Looking at the Dirac matrix 
multiplication table we realize that 


xag" (14.33) 
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for the spinors. Now we notice also that 
[y?, o%”’] 20 (14.34) 


so we define projection operators 


1 2047] 1 E 5 
p ond qu Z (14.35) 
with the following properties 
Pi Pi = Pr 
PgPg = PR 
Py Pr = PrP, = 0 
Py PR —1. (14.36) 


Weyl Spinors and Representation 


Obviously these commute with the c"", so they can be used to decompose 
the Dirac spinor into left and right (2 x 1) Weyl spinors, 


yp = Piy (14.37) 
wr = Pry, (14.38) 


which are actually two-component spinors. We can easily see that Yz and wr 
transform using, respectively, 


oe (o E jns) (14.39) 


and op” = Pro" Pr 


f 
=5 (^ rs jns) l (14.40) 


There is an alternative representation of the Dirac matrices, usually called 
the Weyl representation or the chiral representation, which makes this easier to 
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visualize. Here we have 


o (0-1 i;_( 0. e EOE 
y=\-1 0 Pest 0 Tm i 
ES ND 

oO 


o = ( o ) (14.41) 
Obviously 
1 5 
no (2) 
1 0 
= ( ; " (14.42) 
B 
and Pr= (- = ) 
0 0 
= ( 1 i) (14.43) 
“(3 
sothat y — ; (14.44) 
VR 


We can then see that 


iQ. 
Vice pq Oe Vp —— e'o yr 
; 40i , TEM 
Vn — Yr + ——(—io!) yp — eat y (14.45) 


that is, they transform the same way under rotations, but not under boosts! 
Notice the extra i factor with w°—this is SO(1, 3) or SL(2, C) not SU(2) x 
SU (2). The representations are finite dimensional only by being nonunitary; 
this is a noncompact group. 

We are familiar with this through the use of the adjoint spinor 


Y = (vg) (yg. (14.46) 


which transforms as 


vot at pg (14.47) 


so that V^ y, is an invariant. 
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Notice that 


— y Pg (14.48) 


so that y; Y, = 0 and the Weyl spinors are massless. Only when combined as 
in the (reducible) Dirac spinor do terms involving Y, Wr and V sy; become 
available as mass terms. 


Charge Conjugation and Majorana Spinor 


Now we must briefly discuss charge conjugation and the reality properties 
of spinors. If we have a Dirac equation minimally coupled (gauge) to electro- 
magnetism we have 


(ig-4-m—e Ay =0 (14.49) 


and we can easily establish that if we define the charge conjugate by 


ye -Cy. (14.50) 
then we get 
(ig+m—e Ap =0 (14.51) 
provided that 
C(y^) C^ = (2v. (14.52) 


Actually we should play safe in doubt and use the full indices 


(Wea = Cap" (14.53) 
with | Cos(y ), (C )* = (2) (y). (14.54) 


Beyond the Standard Model 193 


At any rate we can establish that 


CT = -C 
(y°C)? =-y°C whereas (y“C)? 2 —y^C 
(oC)? = -oC (oC) 2 -e"C. (14.55) 
In our original representation we can take a unitary C with CT = -C = C! = 
C^! and C* =C as 
C= iy?2y9 
=o? (14.56) 
which looks like 
eig 
(imer (S. 2 ) (14.57) 


In the Weyl representation this becomes 


—io? 0 
cal 0 is?) 


0 -1 
1 0 
= oil (14.58) 
—-1 0 
We see that 
io? yk 
Wo = ; (14.59) 
—io?y; 


so that the Weyl spinors form a conjugate pair. However, (y.)c = We, so that 
reality can be imposed on the Dirac spinor. This is then known as a Majorana 


spinor. It then has the form 
VL 
VM = (14.60) 
-ic^y; 
and so (Ym): = Ym. 
Finally we return a few pages to rewrite the transformation as 


w Aai 
YL > Yr tio’ Kip, — ELA (14.61) 


"ELE 
where J! = 39 and k' = jT 
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and we see that 
[P] eie ^ 
[, Ki] = ie k* 
[K', Ki] = —ig/*J* (14.62) 


where it is the crucial minus sign in the last equation that shows we have 
SL(2, C) ~ SO(1,3) and not SU(2) x SU(2). 


A Notational Trick 


A fairly standard description has been presented up to this point. Now it 
seems not to be widely recognized that in cases where there is a complex pair 
of spinors (as we have) then y? can be taken along with C. More precisely, we 
shall switch our definition of C to a new one defined by 


iod 
0565 [9 0 
0 1 
—1 0 
= TESI (14.63) 
-1 0 
in both representations. Also note that our new C has the matrix elements 
of e^? = c? = 1 = —e?, This works whether we are working with the left 
spinor or the right one. Moreover, the Majorana spinor now has the form 
VL 
VM = (14.64) 
io?y; 


with the same matrix £^? form. 


SLQ, C) View 
The literature switches between SO(1, 3) and SL(2, C). Here is an SL(2, C) 


view 


Yh = Mf yg oy y Ta = y^ M; (14.65) 
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where * is the definition of inverse by "raising" index 


Definition of 
complex dagger 
=> This is 
Raise and dot the Check 
i Mum d E EEEI 
hye Py * Lip Def^ , i à To! = ~ 
(w= Qr a) = (MP)? E yti — 91 = oio. 


(14.66) 


The dotted indices refer to the indices on the right-handed spinor; the undot- 
ted indices are on the left-handed spinor. The idea with a matrix group, given 
a representation y, is that other (possibly the same or related) representations 
are given by M* (the conjugate) and M^! (the contragredient). 

The important thing is to preserve the order 


M(61, 62) = M(01) M(0?). (14.67) 


Of course it is highly convenient to have a matrix notation related to indices— 
even though the primitive idea is invariance by saturation (summing) of in- 
dices over upper and lower values of the same type (here dotted or undotted). 
Really ý is the (contragredient) representation, but we use i)" for conve- 
nience. The index saturation and the matrix notation coexist happily, as long 
as we do not transpose at will. (Thus, for example, neither y” nor y have 
"indices" in the index saturation sense. There is no "raising" or "lowering"; 
there is no ^metric.") 

Note: In general the complex conjugate representation (and its contragredi- 
ent one) have no relationship (other than conjugation) with the original one. 
This is why we use dotted indices. The dot comes from conjugation; the extra 
transposition is for later convenience. 


Unitary Representations 


(The idea is that for the subset of unitary matrices M we can drop the dots.) 
Unitarity is a property that survives the group law, so we can ask if y trans- 
forms exactly as y7 under the subgroup—and up to a mixing of the compo- 
nents, of course. (This is why we threw a transposition into our definitions— 
for convenience at this point.) 


D ; 
So the question is: Is there a matrix A’ such that y^ a (^ A transforms 


as 3j"? This is precisely what we used y for in defining y. For the unitary 
subgroup (SU(2) rotations) the matrix A is effectively unity—so the bar (or 
adjoint) of the spinor (or fundamental) is just the T and wy is invariant. 
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Notice that our new choice of C is effectively io? in the subgroups, so we 
simply put 


(io?) = e? = ey = 2? = ej = 1. (14.68) 


We can proceed with some identities, an example of which is e“e,, = ó7, 
and have nothing new to learn. However, we must be careful to "raise and 
lower" with matrix order in mind, for example, 


Ve = eV) V” > row 
Vg = ou V; => column (14.69) 
therefore V* = e% V, = ce? V” ep 
= Vra = V” (14.70) 


works out correctly. 


Supersymmetry: A First Look at the Simplest (N = 1) Case 


Itis a fundamental symmetry relating bosons and fermions. It merges Poincaré 
symmetry with internal symmetry in a way that does not violate the Cole- 
man and Mandula “no go” theorem [7]. The theorem says that (apart from 
supersymmetry) we always get Poincaré & Internal, and that the internal is 
compact and semisimple. It was probably first descovered by Gol'fand and 
Likhtman [5], but really understood by Wess and Zumino [1,2,9]. 

In the simplest case we ask if we can extend the algebra of Poincaré & Inter- 
nal by introducing supercharges Q, and Q, which have the anticommutation 
relations 


{Qar Qg} = 26) op Pp (14.71) 
(Qo, Qp} =0 (14.72) 
and are fermionic. Notice that the closure is only onto the translations, which 
commute with themselves. This is essential, because the existence of a consis- 
tent algebra needs all possible generalized Jacobi identities to hold, and lots 
of zeros really help. 
Here o” = (1, o!) makes the connection back between SL(2, C) and SO(1, 3). 
It turns out that we need simple commutators like 


[Qa P^] =0 (14.73) 


and the spinor nature of the supercharges requires 


1 
[Qe My] = 5 (ula Qs (14.74) 
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The algebra is now closed. In this simple case, N — 1, there is just the one 
spinor Q, and so no internal symmetry exists to label states, but see later. 
Whenever the Q, have a nontrivial effect on a state we get a change from 
fermion to boson, or vice versa. It is a theorem that any supermultiplet has equal 
numbers of bosons and fermions. 
If Nr is the fermion number operator, then (—1)'* has eigenvalue 1 for 
bosons, and eigenvalue -1 for fermions. Thus 


(-9* Qu = -Q,(-1)N. (14.75) 
Now 
Tr[(-1)™* (Q,, Q3] = Tr[(-1)** (Qa Qj + [e]199] 
= Tr[ — Q.(-0* Q; + Qu(—1)*¥ Q3] 
=0 (14.76) 


since traces are cyclic for finite entries. But from the superalgebra we now 
have 


2(o*) 4 Tr)" P,] = 0 (14.77) 
and P, is arbitrary so that 
Tr[(—1)*] = 0. (14.78) 


Thus there are equal numbers of fermions and bosons. 

Actually this is stranger than it looks—it extends also from the on-mass- 
shell case here to field representations having to have matching numbers of 
components. 


Massive Representations 


The complete commuting set of observables includes p^, and if p? = m?, 


we go to the rest frame as usual. What new things commute with the P”? 
Obviously the supercharges. 


We define 
1 
la = —— Qa (14.79) 
2m 
pe 
with (a4)! = —— Q,. (14.80) 
2m 
Then 
(as, (ag)') = 8f (14.81) 


and (8, 445] 20— ((a4)!, (ap)"} (14.82) 
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where dots are dropped since we only have rotations in the complete com- 
muting set of observables now. 
We define a Clifford vacuum by 


a,Q —0 (14.83) 


noting that p?Q = m?Q. is not zero. The states arise by creating with (a4)! on 
Q. This soon stops because the (2,)! anticommute. 


All we get is 
Q 
(aa) R 
JL ar = ecc iita tol 
ge (ap)! = — e a) 05. (14.84) 


These are spin zero, spin L and spin zero again. This is the fundamental irre- 
ducible multiplet. Note the match of spin zero bosons with the spin i fermions. 

We can also start with a vacuum Qj; with (2j + 1) components of spin j. By 
addition of angular momentum we get 


je[0e 12 e0]- jo(j*1/20(j -1/24 j (14.85) 


as the multiplet. Figure 4.1 gives the multiplicities in tabular form. 

The Clifford algebra structure reveals that we have an SO(4) invariance 
group, which is isomorphic to SU(2) x SU(2). For more Q’,s—N replaces 1 
here—we find SO(AN) 5 SU(2) x U Sp(2N) giving an “internal” symplectic 
2N structure along with the SU(2) spin. 


Spin | Q | O15» Qı Q3/2 


0 2 1 

1/2 1 2 1 

1 1 2 1 
3/2 1 2 


FIGURE 14.1 


The four vacuum spin components Qo, 21, 4, Q5 plotted against the spins 0, i 13,2. 
2 A 


"P 
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Massless Representations 


Now p? = 0, and we go to the (E, 0, 0, E) frame and the Euclidean little group 
with helicity. Now the algebra of supercharges reads: 


=. 2E 
(Qo, Qj] =2 ( 0 i) (14.86) 
together with 
{Qa, Og] =0= {Qy, Qj). (14.87) 
Defining 
1 
a JE Qi 
d. 
t2 ——Q 14.88 
dE Q ( ) 
we have 
{a,a'‘}=1 
{a,a} =0 = (al, al). (14.89) 


But Q» and Q; are totally anticommuting and must be represented by zero. 
We have a vacuum €, of lowest helicity A, abbreviated to “hel” in the table 
(see Figure 14.2), which is annihilated by (a — ieaQ,) = 0. Then 4! Q; raises 
the helicity by 5 but can only be used once. The massive representation splits 
into two massless ones. In CPT invariant schemes we usually need to double 
up all this. (But the cases for higher N sometimes are self-complete.) 


A -2 | -3/2 | -1 | -1/2| +0 | +1/2| 41 | +3/2 


FIGURE 14.2 


The helicity pairs from (—2, — 3) to (31, 2) plotted against the lowest helicity (“hel” in the table) 
of the vacuum , of lowest helicity. 
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Superspace 


I am going to introduce some new techniques that will make calculations 
easier. They are based on the idea of extending space-time into dimensions 
described by anticommuting coordinates. Just what this space is precisely 
does not seem to be well understood, although the formal manipulations 
seem to be consistent. To establish contact with the previous notation on 
Poincaré, and to provide a familiar example for reference, we shall have a 
review section. 


Three-Dimensional Euclidean Space (Revisited) 


The elements of the Hilbert space of quantum field theory are generated by the 
action of field-valued operators (x) on a translationally invariant vacuum: 


|x) = &(x)|0) 
| X, x') = e(x)o(x')|0) 


(14.90) 


and translations of a state are generated by the energy-momentum operator: 


| 
Iž +ā,ž'+ā)=exp(-iā. P)|ž,ž’), (14.91) 


= o(X)-i[i - P, 9] - -.- (14.92) 
andif [®, P] =-iV® (14.93) 


then ô = 0(x)— (x) 
= e(£—d)- (i) 


= —i[6, d - P] 
= (-id) -(-iV9). (14.94) 
In fact, using 
ei? P(e? EN Y or [a@-P,[a-P,[---[a-P, e], (14.95) 
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we have 
$(x)-o(x-—à) 
= {exp[—ia - (Ci V)]) ^. (14.96) 


Now, suppose we start (as we shall in the supersymmetry case) simply with 
the algebra of the Euclidean group in three dimensions: 


[Mij, Ma] = (6i Mj; — ôi Mk + 5 ji Mik — ô jr Mi) 
[Mij, Pk] = iôix Pj — iô jx P 
[Pi, Pj] 2 0 (14.97) 


where the first line is the algebra of the SO(3) rotation subgroup. A group 
element can be specified by 


g&= exp(—i) (ar + ;^iM;) (14.98) 


where a; and oj; are the (real) parameters. (There are equivalent alternatives, 
e.g., exp(-ia' - P) exp( 5 wij Mij), but this will not concern us.) If we define 


1 
Li = geijkLk 
Mij = EijkLk (14.99) 


then [L;, L;] = leijk Lk 
[Li, Pj] = ieij Pr 
[P;, P ps0 (14.100) 
is an equivalent form of the algebra, and 


g = exp(—i)(a; P; + 6L;) (14.101) 


where 6; = 16i jk@ jk are the parameters now. 

Now consider the three-dimensional coset space, which is the quotient of 
the Euclidean group by the SO(3) rotation subgroup. We pick an origin in this 
space and put coordinates on its neighborhood by exponentiating the tangent 
space at that point; that is, we write a point in the coset space as 


M(X) = exp(-ix; P;) (14.102) 


where x; are the coordinates. The group action is defined by left multiplication 
so that 


g exp(—ix; Pj) = exp(-ix; P) exp(—injL j) (14.103) 


where 5; is whatever it has to be in terms of the coordinates and parameters, 
and x; are the coordinates of the transformed point. 
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The notation is general enough to handle more complicated problems later. 
The general theory [10] assures us that x; — x; is a representation of the full 
group and is linear on the subgroup. In our simple case : 


1. For translations where 


g = exp(—ia; P;) (14.104) 
then g exp(—ix;P;) = exp(—i[x; + aj] P;) (14.105) 
and thus x; = x; +4). (14.106) 
2. Again, for rotations, where 
g = exp(- i8jL j) (14.107) 


then g exp(—ix;P;) = 
= exp(-i6 - L) exp(—ix - P) exp(4-i6 - L) 
x exp(—i0 . L) 
= exp(-ix;[exp(—i6 - L) P; exp(--i6 - L)]) 


x exp(—i0 . L) 


op] -ix 5 Coa TA MU Pals esc . L) 
g exp(—ix; Bj) = exp(-ixi(pi — i0;[L;, Pj] + ---)} exp(—id - L) 


= exp{—i(%; + &ijkO j Xk Tee ) P) exp(—i6 E L). 
(14.108) 


Hence xi =X+ ijj Xk Tee. (14.109) 
We have rediscovered the familiar translation and rotation induced trans- 
formations of the coordinates by the coset space method. Now we will use 
the idea of scalar fields to rediscover the representations of the generators of 


the algebra in terms of the coordinates and their derivatives. A scalar field 
has the property 


$(x) = (x) (14.110) 
or gó(x) = ó(g 1x). (14.111) 
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1. 
Soif g —exp(-iaiP;) (14.112) 
then ó$'-— $(x; — ai) 
= $(x) — iai(—i9i)ó + --- 
= exp{—ia;(—id;)} 
[orp = ¢'-—¢=—ia;(—id;)¢ ] (14.113) 
and we conclude P > —iV. (14.114) 
2: PPS 
Similarly, if g = exp(-i0 - L) (14.115) 
then Q = (xi — eijx0jXk +) 
—$6-— 16; ei; x;( — 10k tee (14.116) 
and we learn that 
Li => &ijkxj(—i9k) = (XnP)i- (14.117) 
Notice that Mij > &ijkLk = —i(xiðj — xjði). (14.118) 


With these exercises under our belts, and the notation established, we turn 
to N = 1 superspace. This time we start with the graded algebra 


[Muv, Moo] = igvoMuo + iguo Moo 


Poincaré SO(1,3) 18 np Mvo — i8vo Mag 
Algebra ARREN [Myv, Po] = —igu, P, + ig) Pu CAN 
[Pu P] = 0 
[Qa, P4] = 0 
1 
[Qur Mov] = 5 (ow) Qs 
= I ee eee 
[Qu Mj] = -5650; 
with Fp» = (Guy)! (14.120) 


(Qo, Qi) = 2(o"),5 P^ 
(Qo, Qg} = 0. (14.121) 
In order to be able to copy the formalism we used in the previous case we 


introduce £, and £” = (£*)*, as anticommuting (constant) parameters. The 
idea is that commutators of objects like (£^ Qa Oe) will then be specified by 
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anticommutators of the Q, and Q,, which are given in the algebra. Thus we 
can obtain group elements by exponentiation and work out products exactly 
as before. Explicitly, we have 


[E Qa + Q,£*, nf Qs + Qjr?] = -E n Qar Qe} — E m (CQ, Os} 
+ EP Qa, Q4 +E nf (Qu Qu) 
= (ERË + E N) ap Py, (14.122) 


which will be used shortly. Notice that if we want objects like (£* Qa -- Q;£^) to 
be Hermitian then we must adopt the convention that conjugation reverses the 
order of “fermionic” (i.e., odd Grassmann) factors without change of sign. Notice also 
that the coefficient of P, in the last equation is an even element of a Grassmann 
algebra rather than an ordinary number. Thus the group parameters must 
share this property. This is a little disturbing, particularly as we are now 
going to parametrize a coset space to produce coordinates in superspace— 
our “ordinary” x“ coordinates would seem to take values in the even part of 
the Grassmann algebra. We shall nevertheless differentiate without worrying 
further—all these formal manipulations seem to work out in a consistent 
manner. 

Quite explicitly then we introduce anticommuting coordinates 0", and 8" = 
(0*)*, and extend our ordinary space-time described by x^ to superspace 
described by z^ = x^; 0%, 8^, by parametrizing the quotient space of the 
super Poincaré group by its SO(1, 3) Lorentz subgroup as 


M = expi(x" p, — 0% Qa — Q8") (14.123) 
and then copy what we did in our example. 


1. For space-time translations, when 


g = exp(ia" P) (14.124) 

then gM — expi[(x^ ca^) P^ — 6*Q, — Q,8"] (14.125) 
^u — D H 

and thus | i CU | (14.126) 
o=o , 8 =H 


2. For Lorentz boosts and rotations, when 


g= exp (-5" M.) (14.127) 
-ió. M a> 3 — sa 
then gM -[ 5 Jesi: P-ro,- ) 


exp (5. 2 exp (5 2) (14.128) 
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so that 


mezok (2 a. M, [o- M, [-- 1 M x*P, - &*Q, -o»q.] 
i co^ 


=expi p^r, 0* Qu 0.8" 2 [Mpa ,x"P,—0*Q, — Q,8] ess | 


x^[P, m jio (—igpu P, T ig, P,)] 


= expi a| Qu + Siw” (ope Qs] 
-[Q; - $o^ $6, Op] 9" 
(x^ + e x") P, 

—expi } —[6% + 19^ (e5)7] Qa (14.129) 
~0, [8° = Lo? ($,,),8^] 


and hence 


a= - qe Tn)". (14.130) 


3. Finally, if we have a pure super-transformation so that 


g = expt-ili* Qu + Q,ET) (14.131) 
then g M = exp(-i[£* Qu + QE if expti[x^ P, — 0? Qs — Q;9^n 
(14.132) 


and we have to work this out explicitly using the Baker-Campbell- 
Hausdorff (BCH) identity 


e^eB — g^*P* 31A Brea LATA BIEeeo[B, LA, B]]--— (14.133) 


Fortunately only the first nontrivial term is needed (i.e., the terms with coef- 
ficients c1, c», etc. are not needed). This is because the only nontrivial com- 
mutator involved gives P,,, which then promptly commutes to zero with all 
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the other P,,, Qe, and Q,. Explicitly we get 
x^ P, — (0% + £*) Q, — Q,(8" +E") | 


g M —expi | 
-i [E Qa + QE" x^ P, - 6° Op — Q48"] 


! D - (0% + £*)Q, - Q,8* - £^) | 
= expi 


+i (era? xg 6,)2(o^),5 P, 


es | [x^ + i(£*8" + EPON] Py | TOT 
-(6* + £") Qu — Qy(O" + £^) 
Hence 
x = x^ HiB’ + EP" 92) (o^), 
9" = 9% +E" 
peg PE" (14.135) 


and we can see why people refer to these as supertranslations. 

Now that we have our “coordinate transformations” we can use the idea 
of scalar field to derive the representations of our generators in terms of the 
coordinates. Taking the group elements in the same order as previously 


1. 
g = exp (ia" P,) (14.136) 
$'(x, 6,8) = P(x" —a", 0,8) (14.137) 
=+ia"idpt---. (14.138) 
So, Prai (14.139) 
= exp(— 50!” Mu) (14.140) 


$'(x, 0, 0)= ~ — gi x", 9% — co!" (Onn) 0 BOT 4 T 
(14.141) 


i(x49y — XvAn) 


ER xo +3 (Gur) gO da bg (14.142) 
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where we have introduced 


Q = Q 
Ow = — and dà = — (14.143) 
90^ 90^ 
so defined such that 
0,07 = 8f and 349^ = sf. (14.144) 


(Note that dg, and 9, anticommute with other Fermi-like objects. Also, 
don't confuse 


(14.145) 


and similarly 
0; with Ox. (14.146) 


In practice this "confusion" does not seem to become a major problem.) 
We find, therefore, 


1 1 x 
My, = (x49, — 13535) + 5 (0,9) 0 da — 5 (6.8 da. (14.147) 
2. Finally, if 


g - exp[ - ilé” Qu + Q;E^)] (14.148) 


$' = plx" —i(£*8^ + £^9*)(0),,,0" —£*,8* — E^] — (14.149) 


MESDETICNTN 


= Zz Web bt. 14.150 
í +(ida — Of (o")559,)8 | í | j 


(Watch the anticommutator signs; also £* are constant here.) 


Hence 
Qa — —i3, +0 (o^), 49, = i, + 8^0, (14.151) 
Qa => —idy — 0? (o^) 559, = tidy + 07 Age. (14.152) 
——— —À'| 


Covariant Derivative Operators from Right Action 


We found a representation of the group action by multiplying the group 
element from the left into the coset space. We can also get a (conjugate) action 
by multiplying from the right. Because we like to keep track of signs, we 
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will multiply from the right by (2) 5; then since (2291) ! = ($1)! (92)! the 
composition law and the subgroup action are identical. 

Now, when the commutators (anticommuting parameters are used when 
needed) of the coset space generators close onto a subset, which commute to 
zero with the original set, then the situation is very simple and we can easily 
find operators (useful ones) that act from the left on the coset space to give 
the right multiplication action. This is easier than it sounds. Here 


[Pj PAHO (14.153) 
[ Pu, Qu] = 0 (14.154) 
{Qu, Qe} = 0 (14.155) 
iu Qi S26 yp Pic (14.156) 


Hence, if we define our action as described, we have 

M(x, 0, 0)g (0, £, 8) 
= expi(x" P, — 8*Q, — p expi (£* Qs + Of) 
= gi 69 gi(0,-&,-8) 


— gi 6B pi(0,—€,—B) 9-i(x,0,8) pi(x,6,8) 


n! 


= exp |X C2" 14,8), 10,8), 10,8), i, en, een 


3 
ll 
f=) 


= exp{i(é, £) — i[(6, 0), i(&, £)] + zeroje 99 
= exp {i (Quy + Q4E^) — (£*8^ + E^0*)2(o"),, P,, fei? 


= exp {i ("Ôu + Qa“) exp{i(x" P, — 0% Qu — Q,8^)] 


(14.157) 

where we have used 
Oy = Qa + 2i8" (o^), P, (14.158) 
Ôa = Qs - 2i6* (o^), Pp. (14.159) 


Now, although this came out as expected, we may wonder where all this 
is going. Just see what associativity of the group law gives; é and n are 
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parameters 


e 69-05 [e eru] 


= gi 6+0 M(z') 
= M(z’)eiEQ+Q8) 


fe eere vain) eilfQ+QE) 


= p-i(nQ+Qi) [ Mizje"#2+28 | 


= gin Oy) feia) o) (14.160) 


Now compare this to the start, and notice that ņ and & are arbitrary > 


(Ô, Q} =0= {Ô, Qh. (14.161) 

Hence 
D, = +iQy = à, — iB (0”)ugðu = da — i8 dy, (14.162) 
and D, = —iQ, = 3, — i6^(o")5,9, = à; — 10 dga (14.163) 


are covariant derivative operators. Note that ð, = 9/0x" was already known 
to be fine, since [P,,, Qa] = 0. 

Of course, we could have discovered this directly but the treatment here 
reveals the underlying motivation, and will be useful as a model for other 
ideas (in compactification) coming later. 

We finally compute (a trivial excercise) directly that 


(Ds, Dg} = (—)2(o") gp (idp) (14.164) 
= (—)2i0g; (14.165) 
= (-)2(0" ag Pu- (14.166) 


This is frequently useful and also reveals the freedom of the curious repre- 
sentation of Q, used by Wess and Zumino [9]. 


Superfields 


What are these scalar fields ®(x, 0,0) in this curious space? We have 
implied their existence, but done little else with them. The basic point 
is that powers of odd (fermionic) objects eventually terminate (nilpotency). 
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For example, 


1 
0,05 = (2564407 (14.167) 


with 0? — 0*0, 


is fine, but 


6,050, — 0 (14.168) 
by antisymmetry. Similarly 
zénB o l agg? 
6*9" = 50 (14.169) 
with 8^ — 8,9" 


is as far as we can go again. So superfields (x, 6, 0) are (complex numbers, 
Grassmann valued) functions defined on (subsets of) M, which must be un- 


. . . . . na 
derstood in terms of their power series expansion in 0* and 0 


(x, 0,8) =f (x) + 6* dux) + Bs x*Q) + 6?m(Qxo) + 8^n(x) 
+ 6,85 VË (x) + 028,39 (x) -- 8 6* y. (x) +078 D(x) (14.170) 


and we notice that spins up to unity can occur in a single superfield (without 
Lorentz indices). Note also that our definition of differentiation with respect 
to spinor coordinates is 


D(x; 0 + 50,6 + 50) = (x: 0,0) + (80*9, + 80 Ja) (x: 0,0) (14171) 
and that there is a graded Leibnitz rule 


9, (AB) = (8,4) B + (—1)!7 A(9, B) (14.172) 


0 if Ais bosonic 


where {A} = | —1 if Ais fermionic ' 
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Supertransformations 
We can now rederive results previously obtained. Consider the full field 
b= fx) decenas +078 D(x) (14.173) 
where à = (—)i(E"Qu + Uag) 
E“ [be +18" (07),52,] 
=} 
+E” [8s + 10° (0) a0, | 
and work this out to equate to 
Be (x) dete +076 (8c D(x)) (14.174) 


and thus find the changes in the component fields. We will just work out one 
piece 


08^ 5, D(x) = (—)£*i8 (o^),52, (028535 (x)] 
— E" i6 (o") pad, [8 0° We(x)}. (14.175) 


1 
Now 06°68 = (-)58 0? 


te) 
5 
a 
g. 
B. 
D 
=F 
< 
DI 
e 
S 
e 
Il 
£ 


Apologies to other authors, but the notation here allows our conjugation of 
equations—so NOW &4, = E«p(= Eag numerically), and dotted indices are 
raised and lowered exactly as undotted ones. 
Hence 
028; D(x) = £* (0a D DAP) +E" 5(o" pas 078 9^. (x) 
i 52 a É zP a 
= 50°8 (c^), [E Af +E ony} (14.176) 
i a » f zË a 
or 6D(x) = AG AE +E y) (14.177) 


since the parameters are constant. Notice that this is a total derivative. 


Notes 


1. The product of two superfields is (trivially) a superfield, so that su- 
perfields are like the tensors of supersymmetry. 
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2. In general, a superfield will be reducible. We find the irreducible rep- 
resentations by imposing constraints that do not yield equations of 
motion in four-dimensional space-time. These can be reality condi- 
tions, or D, = 0 type conditions, as we shall see. There is so much 
new notation to confuse us that we had better make contact with 
something simple, then go on later to see its full beauty and power. 


ee 
The Chiral Scalar Multiplet 
We shall show that the condition 

Dy =0 (14.178) 


gives exactly an irreducible multiplet. Recall that 


Dg = E» + i0 P(o) gadu (14.179) 
so that D40’ =0 (14.180) 
and Dyy” =0 (14.181) 


where y" — x" —i(o") 0°” 


follows trivially. This means that any function ®(y, 0) is a solution of the 
constraint. We can expand this as 


P = Aly) — 20° Waly) — 0°F (y) (14.182) 


with the notation previously introduced; the factors and signs are designed 
to give exact contact with Wess and Zumino. Now it would be convenient to 
express this back in terms of x, 0, and 9. 

Notice that 


(—i0 B (o"),59,]x" = —i0*8^ (a^), y (14.183) 
so that ®(y, 0) = exp (— i08 d p) [A(x) — 26*y.(x) — 8? F (x)) 
B ORA y 
= (1-10°0",5— TATI [A — 20 ^y, — 8? F) 


(14.184) 


_ AQ) — 20 ^y, (x) — 0? F (x) — i0"8^2,; A(x) 
xi6?8^3,, (x) — 1076 OAK) 


(14.185) 


where we have used 9, pa? = 20. Clearly this has the same content we had 
previously. Now we must check that the transformation rules are the same. 
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We just work out a supercharge 


bg — (-)£*Q. + QuE ]o 
£* (da +18" d,9) 
=(-) (Eq" (14.183) above} 
+E" (9, + i0 dga) 
= 28" (x) + 2*6, F (x) — i£ 0^9, A(X) 
— i£^6?9,,y^(x) — E10 Papal A(x) — 20^y,(x)] 
+ terms in 8. 
(We used 3,0? = 20x.) 


= 2£* y (x) — 28" [ — iF “daa A(x) — & F (x)] 
— 0?2iẸ "dpa ^ (x) + terms in 8. 
Hence 


ôs A(X) = 2£* ys (x) 
ôs Wal x) = (—)iE daa A(x) — £a F (x) 
beF (x) = 2iẸ "apa Y (2) 
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(14.186) 


(14.187) 


(14.188) 
(14.189) 


(14.190) 


(14.191) 
(14.192) 
(14.193) 


and we have confirmed the Wess and Zumino result. Notice the familiar total 


divergence in ô; F. 


Superspace Methods 


We are now going to exhibit some of the power of these methods. First, we 
record some very useful identities that we can check directly and quickly: 


(D,, Dg) 20 
[Da D^] = (- Mia, D* 
(Ds, Dg} = (—)2i dah 

[Ds, D?] = 4i9,, D* 

(Di, Dg} = 0 

[D?, D^] = 4ia,,[D^, D°] 
D?D'p? = (—)160D? 


D'p?p! = (-)160D°. 


(14.194 
(14.195 
(14.196 
(14.197 
(14.198 
(14.199 
(14.200 


) 
) 
) 
) 
) 
) 
) 
(14.201) 
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On a chiral superfield, when 


D,o —0, (14.202) 
D4,D^o = 4io,,D*4 (14.203) 
and D/D?o9 = (—)1606. (14.204) 


Covariant Definition of Component Fields 


We know that 9, = 9/90" is not covariant. We can improve on this by using 
Da, and can satisfy differential constraints automatically. For example, take 
our chiral field with D; — 0. Define (with previous notation in mind) 


®lp_- 9-9 =A (14.205) 
Dy Plo = (—)2Wa (14.206) 
D?o|, =4F. (14.207) 


Supercharges Revisited 


We can work out the supercharge much more easily now. Notice that 


Dalo = i Qu, (14.208) 

= Qulo (14.209) 

and Dalo = (—)iQelo (14.210) 
= —iQ,lo, (14.211) 


which allows us to use our identities. (Warning: Naturally Dy Q;lo # Dai Djlo, 
so we must take Q and Q to the left of all derivatives before switching to D 
and D.) Then, for example, 


ô A= Bolo (14.212) 
= (-i)(E*Q, + QE"), (14.213) 
= (=° Da + D)e (14.214) 
= —E" D, lo (14.215) 


= 2£" y. (x), (14.216) 
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which is the Wess and Zumino result yet again. 


Again ôy = (5 Ae (14.217) 
0 
= D,(-i)[£^Qs +g je (14.218) 
= Sl Qs + QF 1D.& (this step is essential) (14.219) 
0 
1 ee: 
= 518? Dp - DID (14.220) 
1 B lop 5 
= 56 DpDa®| + 58 (Dj, Da}? (14.221) 
0 0 
1 1 E 
= £8 {_= 2m] _ 7EP9 , 
= 58 ( ;) Epa D o| ig age, (14.222) 
= -&F — iẸ dng A (14.223) 


and we confirm the previous result. 


1 
Finally 6;F = ;D o (14.224) 
0 
1 2 
= zô DD (14.225) 
4 0 
T. — Ó 
= 10)0l^Q. + Qag JDO (14.226) 
0 
1- l 
= (4E D,D'e since D =0 (14.227) 
1.j = 
= (3i aa Dto ... since Dj — 0 (14.228) 
= iF? 9, P 
= 2iẸ agg (14.229) 


yet again. Clearly these methods are rather clean and powerful. Now it is time 
to rediscover the invariant Lagrangian of Wess and Zumino. We need the idea 
of superspace integration [11]. We will start in a one-dimensional case, where 


f(x, 0) = fo(x) + fi(x)8 (14.230) 


and demand linearity, so that 


fora n [a n fave 


— lofo-- hfi (14.231) 
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where lo and I; are (universal) constants. We then demand translational 


invariance, so that 


[ 46-9562 = fh fe t) 
= lo(fo4- fie) - ha 


and have Ip = 0. This means that 
ELS + fi0) = fih 
and [ee ~ 9/90. 


The integral I; is conventionally taken as unity, so that 


Also 


so integration by parts works while 


fevers 


= (6 =0) 
so that ô(0) 2 90. 


Notice that 6(6)5(0) = 0. 
We define 


[49 = [eanaorae® 


and [ 49» f simt aot 


with [49 = [roa 


(14.232) 


(14.233) 


(14.234) 


(14.235) 
(14.236) 


(14.237) 


(14.238) 


(14.239) 


(14.240) 
(14.241) 


(14.242) 
(14.243) 


(14.244) 
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[ee f(0) = [eena0%40° (fo — 20” fy, — 8? Fo] (14.245) 


= 0404 f2)Eug f do*de’{ — «,,0°0"} (14.246) 
= (-)4 fo f de 04909090 (14.247) 
= 4f. (14.248) 
But recall 

9? f — 9*3, f (14.249) 
= 0 + 0 + e"5352, (2) f0’) (14.250) 
= (—) fe dg {20a} (14.251) 
= (—)2 frye” eyaðg0 Y (14.252) 
=4f (14.253) 

also. Hence 
/ d?0f — 9? f. (14.254) 


E 
Invariants and Lagrangians 
Consider the invariant 


I= fas d?0 d?0 L(®, Dy®,---), (14.255) 


which is index saturated in the usual way. We avoid Taylor expansion (al- 
though it is sometimes easier) by noticing that 


dw = Da 
nis / d'y. by throwing divergences away. (14.256) 
Di 


[ 
S 
Il 


Then 


I= fas DD’ L 


(14.257) 
0 
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and we evaluate this integral by use of our identities and component defini- 
tions. For example, with chiral again, 


I- Í dfx d0 © (14.258) 
= f dix D D Oo (14.259) 
0 
zie / d^xD Doo (f a2 > 0) (14.260) 
0 
= (-) / d‘xD $Do (since D? $ = 0) (14.261) 
0 
-oes (6 D D 6 £2(D,9)D' D? & + (D^ $)D? oj 
(14.262) 


=(—) [a*x((-)169 00+ 2(D,) 4ie**a,, DY &+ (D^$)(D^0)) 


0 
(14.263) 


=(-) [atx (162 A+ 8120 tP, 4(—)2” + 16F F} (14.264) 


since 6 = A-2y,0 -F0 > D46 —2y, 


= (16 f 4*3 A+ 21,0 ys + FF}. (14.265) 


This is (—32) times what is usually called the Wess and Zumino model La- 
grangian. Often it is put into the Wess-Zumino language by writing 


A > A+iB " 
and y e (5x) (14.266) 
Bp baie y 
so that 
y y -—y E pu y (14.267) 
= 2y 9^? yr, (14.268) 
we get I = (32) f ats Iste. + (3,BY] - iv gy + E? + e 


(14.269) 


— (—32). (Usual form of Wess-Zumino model Lagrangian.) 


Notice that when we write invariant Lagrangians like this in supersymme- 
try, they are really only invariant up to a total derivative which is then thrown 
away under an integral. This is exactly the way the D-term transformed in 
the full scalar case, and the F -term transformed in the chiral case. 
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At any rate we can see that Equation (14.265) contains kinetic-like terms, 
and we take 


Lo = (-5) pars d‘0(®) (14.270) 


as the first part of our general chiral scalar Lagrangian. We seek mass terms, 
and interaction terms. Consider 


l= fas d*o P(®) (14.271) 


where P is polynomial and therefore chiral if ® is chiral. This is invariant. 
Indeed f d*x f 4*0 P = [d4xd20D P =0,ormore generally, f d*x d*6 Z = 
Tab d 20(D z) where Z is general but (D'z) is chiral because D^ = 0. Then 


I- f d^x D?P (14.272) 
0 
= | d*x D* D,P (14.273) 
0 
= l dfx D°P'D,® (14.274) 
= j d*x[P"(D,9)? + P'D?9] (14.275) 
0 
= a fats [P"(A)y? + P'(A)F]. (14.276) 
Now we can see what to do. We write 
(o T 4 20 $2 25 8? 
Lea i) fats [f[ 009 ELLE (14.277) 
m 4 2 T2 LATE 
=(-5) dx {w24+ AF +H + AF} (14.278) 
then with 
geal ey es pA (14.279) 
F > F-iG[' y ` 
as before, 
1— 
Ln = (=m) f atx [Sv + AF + Bc) (14.280) 


and we have our mass terms. 
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Finally we need interaction terms (not too high in powers of scalars, or we 
face nonrenormalizability), and we try 


NER AM 4 2m3 1 Wa? 
Ed £) fate (De +D) (14.281) 
= (-5) | d*x A(6AJ? - 3E +6AY c 3A F} (14.282) 
= cofets {W(A+ Bys)y + (4 — B?)F +2ABG} (14.283) 


when we make the, by now familiar, substitutions. 


Notes 


1. 
1 4 2 1 4 2 
i] 4x49 1 | xD (14.284) 


picks out the F -component of a chiral integrand. 
2; 


D'D'o- D*D,D'o (14.285) 


= D*[D'D, — 4i3,4D^ |o (14.286) 


= D*D'D,oó + fo under / es] (14.287) 
so x fats do = x fat dD’ (14.288) 
16 16 l 


1 m 
== f d*x D° D’ D, (14.289) 


picks out the D-component of an integrand. This latter form is par- 
ticularly heavily used, as it gives a convenient order of terms when 
dealing with gauge invariance. 

3. Z general > DZ is chiral. 


4. A product of two chiral fields is again chiral, by the Leibnitz rule. To 
find the components 


As = Aı - Azlo (14.290) 
= AA (14.291) 
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then the real form is 


As = Ay Ay — B4B; (14.292) 
B; = AB; — By Ap (14.293) 
1 
Voja = (—) z De - Ao (14.294) 
0 

1 1 

=(-) 5(DaAi) Ar + At (-50«4:) (14.295) 
0 

= Vo A2 + A1 rojo (14.296) 
Way = (4 — ysBi) Wy + (Ao — ys Ba) Way. (14.297) 


Superpotential 


There may, of course, be several chiral scalars y; and we can now write the 
general Lagrangian for chiral superfields. There is a kinetic term obtained 
simply by summing over the fields 


Lo= 25 (-) jo d^0 ($0). (14.298) 


Then there are mass terms, and interaction terms, which are generally grouped 
together into what is called the superpotential W(®). 

In our language we have the extra part of the Lagrangian (with m;; and gijx 
real and symmetric) as 


1 
(-3) pars d?0 W(®) + h.c. (14.299) 
1 1 
where W(9) = jnij P,P; + 38k Pi Pj Pr. (14.300) 


Writing ¢ where we previously wrote A, to standardize the notation, the 
whole thing is described by 


£ — 9,0109" di + 2iy;o 79, V; + F] F; + mijGiF; 


1 
= mij Vid + SiikPi Pj Fe — gijk Vivid + h.c.. (14.301) 


222 Group Theory for the Standard Model of Particle Physics and Beyond 


The Euler-Lagrange field equations from this Lagrangian are 


9,9 ó; = mijF| + 2gij dI Fi (14.302) 
io "9, y; = -miji — 2gijk V jo] (14.303) 
FÌ = —mijój — giikójds (14.304) 
9W(9) 
Ss 14. 
r (14.305) 


where W(¢) is the superpotential with o; — ¢;. The last equation of motion, 
Equation (14.306), confirms that the F; are auxiliary fields—no derivatives 
appear—and they may be algebraically removed. Then 


£ — 9,019" d; +i YT "3, y; — FF; 


1 
= E Vi UE + gijkiVvior + he.) (14.306) 
= 9,019" d; +i YT 9, V; — mijó; + gijkó;di|? 
1 
= (mdi + Gijk Wil ior + he.) : (14.307) 


Notice that the tree level effective potential is 


y = FÍF; (14.308) 
= |F;|? (14.309) 


which is always > 0. Absolute minima of V are where F; = 0. Switching back 
to the real form (with Majorana fermions) this Lagrangian reads 


=a, pld" o] — S. Wi — jus, vj - SE Rin — Firs ile 
- Ew, + Visi (14.310) 


which most phenomenologists prefer, since they are happier with Dirac 
y-matrices and not with dotted and undotted Weyl spinors. The Feynman 
rules are normally given in this form. 

Despite many opportunities supersymmetry ignored them all, and more 
or less was dragged protesting into our four-dimensional world in 1974 by 
Wess and Zumino [1, 2, 9] who were referring to a paper by Ramond [3] 
aiming to introduce particles with half-integer spin into a string theory and 
also referring to a paper by Neveu and Schwarz [4] suggesting the addition 
of d fermionic field doublets. At any rate Wess and Zumino constructed sev- 
eral supersymmetric models. The simplest involved a single Majorana (self- 
charge-conjugate Dirac) field (y), a pair of real scalar and pseudoscalar boson 
fields (Aand B), and a pair of real scalar and pseudoscalar bosonic auxiliary 
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fields (F and G). This was invariant under the infinitesimal transformations 
(the notation has changed over subsequent years). 


bA=aw (14.311) 
ôB = (—)iaysvy, (14.312) 
which are connected to 
ôy =ð, (A+ iysB)y"a + (F —iysG)a, (14.313) 
ôF = wy" v, (14.314) 
ôG = (—)i&ysy “ðu V (14.315) 


where a is an arbitrary constant infinitesimal Majorama fermion c-number 
parameter. The most general real Lorentz-invariant, parity-conserving, renor- 
malizable Lagrangian density built from these objects is 


1 1 t= 1 
L = (-)59,A9" A — ;9,B9^B — wy Qui + YF + G?)+ 


m (ra +GB— 2) + g[F(A? + B?) + 2GAB — v(A + iysB)v]. 


(14.316) 


Since the auxiliary fields F and G enter quadratically, we can derive an equiv- 
alent Lagrangian by setting them equal to the values given by the field equa- 
tions to see that 


F = (—)mA — 2(A? + B?) (14.317) 
G = (—)mB — 2gAB. (14.318) 


The Lagrangian density then becomes 
1 1 l- 1 
£ = (—) =, Ad“ A — 29, Bo“ B — -yy"9, y + —(F* + G’) 
2 2 2 2 
T~ — ; 
+m |r4« GB — jv + 2[F CA. + B?) + 2GAB — (A+ iysB)y]. 
(14.319) 


Since the auxiliary fields F and G enter quadratically, we can derive an equiv- 
alent Lagrangian by setting them equal to the values given by the field equa- 
tions 


F =—mA-— g( A + B’), (14.320) 
G = —mB — 2g AB. (14.321) 


The Lagrangian density then becomes 
1 1 Í= 1 E= 
L= |= 53.40 A - 594 Ba" B — "12A — Qm Uf + B?) — PUTA 


—gmA(A + B?) — ras + B?)y — gf(At+ insBv | À (14.322) 


224 Group Theory for the Standard Model of Particle Physics and Beyond 


This Lagrangian density exhibits relations notonly between scalar and fermion 
masses, butalso between Yukawa interactions and scalar self-couplings, which 
are characteristic of supersymmetric theories. 

Unknown to Wess and Zumino, at the time of their first papers on super- 
symmetry in four space-time dimensions, this symmetry had already ap- 
peared in a pair of papers published in the Soviet Union. In 1971 Gol'fand 
and Likhtman [5] had extended the algebra of the Poincaré group to a super- 
algebra and used the requirement of invariance under this superalgebra to 
construct supersymmetric field theories in four space-time dimensions. In- 
dependently, Volkov and Akulov [6] in 1973 discovered what we would call 
spontaneously broken supersymmetry, but they used their formalism to iden- 
tify the Goldstone fermion associated with supersymmetry breaking with a 
neutrino. 

Coleman and Mandula [7] proved a celebrated theorem to the effect that the 
only Lie algebra (as opposed to superalgebra) of symmetry generators con- 
sists of the generators P, and J of translations and homogeneous Lorentz 
transformations, together with possible internal symmetry generators, which 
commute with P, and J,,, and act on physical states by multiplying them with 
spin-independent, momentum-independent Hermitian matrices. There are a 
number of extra conditions that are not needed for our purposes, a general- 
ization to include the remaining Lie algebra of the conformal group by Haag, 
Lopuszanski, and Sohnius [8]. This publication effectively establishes the re- 
sult using the elementary particle states, both massive and massless, known 
at that time. 

At this state of our understanding we do not need to know how to con- 
struct all massive and massless representations of supersymmetry. It is worth 
knowing, however, that there is only one kind of massless supermultiplet 
in theories with simple supersymmetry. There are no massless particles that 
are not accompanied by a superpartner or ones that have more than one su- 
perpartner. How can the known quarks, leptons, and gauge bosons fit into 
this picture? We assume that the supersymmetry generator commutes with 
the generators of the SU(3) x SU(2) x U(1) gauge group. The quarks and 
leptons cannot be in the same multiplets as the gauge bosons. In the limit 
of high energy where SU(2) x U(1) symmetry breaking can be neglected, 
the massless quarks and leptons of each color and flavor are in supermul- 
tiplets with pairs of massless squarks and sleptons of zero helicity and the 
same color and flavor, while the massless gauge bosons are accompanied 
by massless gaugines of helicity +4 comprising an adjoint representation of 
SU(3) x SU(2) x U(1). 

Because gravity exists we know that there must exist a massless particle 
of helicity +2, the graviton. There are no conserved quantities to which a soft 
massless particle with |A| > 2 could couple. We conclude that the graviton 
must be in a supermultiplet with a massless particle of helicity +3. This is 
a gravitino, coupled to the supersymmetry generators themselves. The field 
theory of this multiplet is known as supergravity. 
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Problems 

14.1 Confirm contact with previous notation by retrieving x; = x; + 
£ijk0jXk for [3] rotations. 

14.2 Repeat Problem 14.1 but for 6) = 6, — £(6; o;)80g lowering the spinor 
index. 

14.3 Check the reality of ôx”. 

14.44 Retrieve: L; > €;jnXj(—10k) + iloi) £0? dq = 1:40 da 
and check directly that [(—)i6 - L]6, = 1(o;)56,65 as it should be to 
induce transformations of fields as imposed. Note that this exercise 
is supposed to draw attention to 6; V° 6,, and to remind us that 
in our notation Cog = ag, but (C1)? = (—)e**, so that (o;) = 
(—)Cow (oi) gr (C^!) d ]. 

14.5 Check Q, and Q, are related by conjugation. (Watch [ougl* = Ope, 
but 9, = (—)9, = (dup) = (—) dpa.) 

146 Check (Q,, Qi) = 20,j P, directly. (Warning: Watch out for Wess 
and Zumino here.) 


147 Check the statement on subgroup action. 


14.8 Work through our Euclidean example. Confirm that rotations are 
unchanged, but that for translations X > X — à under right action 
compared to X — X + à under left action. 

149 Check (D,, Qs] = 0 directly. 


14.10 (Trivial) Show D, and D, give Leibnitz rules, then work out D, D;6,93 
a few different ways for practice. 
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14.11 Work out the 0 terms and check consistency. 
14.12 Check 


D?o|, = D Daha 
= D” BEL 
= D'(-2F6,)|, 
copper 
— AF. 


Notice that, if we go on, we get 


D;,D,o|, = {Da D,)ó| (since D; = 0) 
= (—)2i dva Plo 
= —2i0ss A. 


and the differential constraints are now automatic. 


14.13 Check the result of Problem 14.12 explicitly using the previous 
(explicit) form of ®. 


14.14 Check through the other pieces—obviously D =0= D’, so this 
stops—in agreement with our other expanded form. 


14.15 For example (work through) 


/ 4/87 =4F 
=a f. 
14.16 Work out the things that follow by this method 


F; = Fy Ay + A Po — Vaio + WA) Vora 
F; = Fy Ay + F2 Ay + G1 B2 + B1G5 + Yvo 
G3 = G1% + G24 — Fi By — FB, — VayYsVo- 


14.17 Notice that the Coleman-Mandula theorem deals only with trans- 
formations that take bosons into bosons and fermions into fermions 
and are therefore generated by operators that satisfy commuta- 
tion relations rather than anticommutation relations. This raises the 
question of whether a relativistic theory can have symmetries taking 
fermions and bosons into each other and therefore satisfy anticom- 
mutation relations rather than commutation relations. Show that 
supersymmetry is the only possible solution to this situation. 


14.18 Show that the most general symmetry algebra allowed under the 
assumptions of the Coleman-Mandula theorem in the case where 
all particles are massless consists of internal symmetry generators 
plus either the Poincaré algebra or the conformal algebra. 


14.19 Calculate the change in the Wess and Zumino Lagrangian density 
under the space-time supersymmetry transformation. 
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14.20 Find aset of 2 x2 matrices that form a graded Lie algebra containing 


14.21 


fermionic as well as bosonic generators. 


In 2 space and 1 time dimensions you can take the generators of the 
Lorentz group as A; = (—)iJ1o, Æ = (—)iJ20, and As = J12. 

The commutation relations of the Poincaré algebra are [A;, Aj] = 
i X x Eijk Ak so the representations of the homogeneous Lorentz group 
in 2 + 1 space-time dimensions are labelled with a single positive in- 
teger or half-integer A. Following the approach of Haag, Lopuszan- 
ski, and Sohnius derive the most general symmetry that can be sus- 
tained. 
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